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PREFACE 

This is an introductory textbook on equilibrium statistical mechanics. 
It is called Statistical Thermodynamics to distinguish it from the author’s 
earlier work, Statistical Mechanics (McGraw-Hill, 1956). The latter is a 
treatise covering selected topics in detail and at a relatively advanced 
level; it is neither introductory nor primarily a textbook (except for a 
second or specialized course). The two volumes in fact complement each 
other. Statistical Thermodynamics (or some other introductory textbook, 
such as Rushbrooke’s) is prerequisite to Statistical Mechanics. Where 
there is overlapping (e.g., imperfect gases, distribution functions, Ising 
problem, etc.), the discussion given in the present work is more elementary, 
less general, and should serve to smooth the way for a more detailed study 
in Statistical Mechanics. 

Although introductory, Statistical Thermodynamics provides a quite 
extensive coverage of topics of current interest in equilibrium statistical 
mechanics. This is its principal justification for existence. However, non¬ 
equilibrium statistical mechanics has been omitted because: (a) the 
foundations are not yet established; (b) it is difficult to discuss this sub¬ 
ject on the elementary level used in the rest of the book; and (c) the author 
is not expert enough in this field, at least at the present time, to do it 
justice. The reader will find treatments of nonequilibrium problems in 
Kittel (see the list of references at the end of this Preface) and, much" 
more extensively, in a book by I. Prigogine.* 

Up to, say, 1945, the usual course in statistical thermodynamics in 
chemistry departments was concerned primarily or entirely with how to 
calculate thermodynamic functions of ideal gases from spectroscopic data. 
Such a choice of subject matter was not inappropriate, for this was the 
principal area of interest and research in statistical thermodynamics in 
the 1930’s. However, the author feels that a modem introductory course 
in statistical thermodynamics should reflect the developments of the 
1940’s and 1950’s. For this reason, the more traditional material (Chap¬ 
ters 4 and 8 through 10) referred to above is given a rather condensed 
treatment to provide room for a survey of more recent advances. 

The book is divided into four parts. Part I (Chapters 1 and 2) is con¬ 
cerned with the principles or postulates of statistical mechanics. The 
argument is based on elementary quantum-mechanical ideas such as 

* I. Prigooine, Nonequilibrium Statistical Mechanics. New York: Inter¬ 
science, 1961. 
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energy levels, states and eigenfunctions, degeneracy, etc. The principles 
of classical statistical mechanics are almost entirely omitted because these 
follow as a limiting form of quantum statistics and also because the classical 
theory is more sophisticated and difficult in many ways than the quantum 
approach (at least in the elementary form given here). The interested 
reader should consult the very elegant exposition of the principles of 
classical statistical mechanics given by Tolman (see the reference list 
below). Certainly any student who intends to pursue statistical mechanics 
beyond a first course must turn to a study of Tolman sooner or later, 
preferably sooner. 

Part II (Chapters 3-13) contains applications of the principles de¬ 
veloped in Part I to systems of independent molecules (e.g., an ideal gas) 
or of other independent subsystems. The more complicated but also more 
interesting problems which arise when molecules can no longer be treated 
as independent of each other (because of intermolecular forces) provide 
the subject matter of Part III (Chapters 14-21). 

Most of the applications in Parts II and III have to do with the classical 
(high-temperature) limit of quantum statistics. Part IV (Chapter 22) is 
concerned with problems for which the classical limit is not valid (e.g., 
helium gas at low temperatures). The sections of Chapter 22 could have 
been distributed among the appropriate chapters of Parts II and III 
(some instructors may in fact prefer to use this order), but the present 
arrangement has the advantage (for many students) that only the most 
rudimentary quantum-mechanical background is necessary in Chapters 
1-21. Somewhat more quantum mechanics is required in Chapter 22 but 
still not very much. 

The level of the first 16 chapters is fairly uniform. Of these, Chapter 1 
may seem the most difficult, at least on first reading, because it is neces¬ 
sarily the most general. It may be wise for the average student to return 
to this chapter for rereading after having acquired some familiarity with 
applications in later chapters. Parts of Chapters 17-22 are somewhat 
more advanced than the earlier chapters. For this reason, and because 
more recent contributions are involved, there are rather more references 
to the research literature in the last part of the book. 

Throughout the text, the intent, with each topic treated, is not to give 
a complete discussion that brings the reader up to the research frontier 
on the subject, but rather to give an introduction only. Usually this means 
that a somewhat approximate first-order theory is outlined. The most 
recent details and refinements are intentionally omitted to keep the book 
within reasonable compass and also because the level would otherwise be 
too advanced for a beginning textbook. As a next step in pursuing any 
given subject to a more advanced level, the student should consult the 
works listed at the end of each chapter and the literature references. 
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IX 


An author index has been omitted because many of the more advanced 
and important references have not been included in the text—for reasons 
just mentioned. In this connection an apology should be made for the dis¬ 
proportionate number of references to the author’s own papers. This is a 
consequence not of the relative importance of these papers but rather of 
their relative simplicity and of the author’s familiarity with them. 

The arrangement of chapters is such that a great deal of flexibility is 
possible in adopting the book as a text. As examples, we list below chap¬ 
ters and sections which might be used for (a) a one-semester graduate 
course in chemistry; (b) a two-semester graduate course in chemistry; 
and (c) a one-semester senior or graduate course in physics. 


Chemistry 
One Semester (3 hours) 
Chapter 1 

Sections 2-3 and 2-4 
Chapters 3 through 6 
Sections 7-1 and 7-2 
Chapters 8 through 11 
Sections 14-1, 14-4, and 
14-6 

Sections 15-1 and 15-2 
Chapter 16 
Section 17-1 
Section 18-1 
Section 20-1 
Sections 22-1 and 22-8 


Chemistry 

Two Semesters (3 hours) 
Chapters 1 through 16 
Sections 17-1, 17-2, and 
17-4 

Section 18-1 
Section 19-1 
Section 20-1 
Sections 21-1 and 21-2 
Sections 22-1 through 
22-4, and 22-8 


Physics 

One Semester (3 hours) 
Chapters 1-4 
Sections 22-1 through 
22-4 

Chapters 5 and 6 
Section 7-1 
Chapter 8 
Section 22-8 
Chapters 9 and 10 
Chapter 12 

Chapters 14 through 16 
Sections 17-1 and 17-2 
Section 18-1 


It will be obvious from the choice of subjects that the author has had 
physical chemists (and physical biochemists) particularly in mind in 
writing the book. However, as just indicated above, by suitable omissions 
and rearrangements one can easily use the text for an introductory course 
in statistical mechanics in a physics department. 

The reader is assumed to have studied thermodynamics, calculus, ele¬ 
mentary differential equations, and elementary quantum mechanics. 

The problems vary widely in difficulty. They range from simple nu¬ 
merical exercises to small-scale “research” problems. The first problems 
listed in each chapter are referred to, in passing, in the text. Many of 
these contain details that the author feels should not be spelled out in the 
text but that the reader should verify. 

The most important sources of reference for the student are the fol¬ 
lowing: 

Band, W., Introduction to Quantum Statistics. New York: Van Nostrand, 1955. 

Flory, P. J., Principles of Polymer Chemistry. Ithaca, N. Y.: Cornell, 1953. 



X 


PREFACE 


Fowler, R. H., and Guggenheim, E. A., Statistical Thermodynamics. Cam¬ 
bridge: 1939. 

Guggenheim, E. A., Mixtures. Oxford: 1952. 

Hildebrand, J. H., and Scott, R. L., Solubility of Nonelectrolytes. Third 
Edition. New York: Reinhold, 1950. 

Hill, T. L., Statistical Mechanics. New York: McGraw-Hill, 1956. 

Hirschfelder, J. 0., Curtiss, C. F., and Bird, R. B., Molecular Theory of 
Oases and Liquids. New York: Wiley, 1954. 

Kittel, C., Elementary Statistical Physics. New York: Wiley, 1958. 

Landau, L. D., and Lifshitz, E. M., Statistical Physics. Reading, Mass.: 
Addison-Wesley, 1958. 

Mayer, J. E., and Mayer, M. G., Statistical Mechanics. New York: Wiley, 
1940. 

Prigogine, I., Molecular Theory of Solutions. Amsterdam: North-Holland, 
1957. 

Rushbrooke, G. S., Introduction to Statistical Mechanics. Oxford: 1949. 

SchrOdinger, E., Statistical Thermodynamics. Cambridge: 1948. 

Slater, J. C., Introduction to Chemical Physics. New York: McGraw-Hill, 
1939. 

ter Haar, D., Elements of Statistical Mechanics. New York: Rinehart, 1954. 

Tolman, R. C., Principles of Statistical Mechanics. Oxford: 1938. 

Wilson, A. H., Thermodynamics and Statistical Mechanics. Cambridge: 1957. 

Note. A supplementary reading list is given at the end of each chapter. 
To avoid repetition, the above works are listed by authors only. The 
present writer’s earlier book is referred to throughout the present text 
as S. M. 

The author is greatly indebted to Dr. Dirk Stigter and Mr. Robert E. 
Salomon for reading the entire manuscript and making many helpful sug¬ 
gestions. Parts of the manuscript were also read and valuable criticism 
given by Professors George Pimentel, Nobuhiko Saito, and Tsunenobu 
Yamamoto. While writing the book, the author had the benefit of many 
stimulating discussions with Professors Saito and Yamamoto. Finally, 
the author wishes to express his appreciation for partial support from 
the Alfred P. Sloan Foundation during the period in which the book 
was written. 


February 1960 


T.L.H. 
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Part I 

Principles of Quantum Statistical Mechanics 





CHAPTER 1 


STATISTICAL-MECHANICAL ENSEMBLES AND 
THERMODYNAMICS 

I-I Introduction. The object of thermodynamics is to derive mathe¬ 
matical relations which connect different experimental properties of 
macroscopic systems in equilibrium—systems containing many molecules, 
of the order of, say, 10 20 or more. However useful, these interconnections 
of thermodynamics give us no information at all concerning the interpreta¬ 
tion or explanation, on a molecular level, of the observed experimental 
properties. For example, from thermodynamics we know that experi¬ 
mental values of the two heat capacities C p and CV for a given system must 
be interrelated by an exact and well-known equation, but thermodynamics 
is unable to furnish any explanation of why particular experimental values 
of either C p or CV, taken separately, should be observed. Such an explana¬ 
tion falls rather within the province of statistical mechanics or statistical 
thermodynamics, terms which we regard in this book as synonymous. 
That is, the object of statistical mechanics is to provide the molecular theory 
or interpretation of equilibrium properties of macroscopic systems. Thus the 
fields covered by statistical mechanics and thermodynamics coincide. 
Whenever the question “why?” is raised in thermodynamics—why, for 
example, a given equilibrium constant, Henry’s law constant, equation of 
state, etc., is observed—we are presented with a problem in statistical 
mechanics. 

Although thermodynamics itself does not provide a molecular picture 
of nature, this is not always a disadvantage. Thus there are many com¬ 
plicated systems for which a molecular theory is not yet possible; but 
regardless of complications on the molecular level, thermodynamics can 
still be applied to such systems with confidence and exactness. 

In recent years both thermodynamics and statistical mechanics have 
been extended somewhat into the nonequilibrium domain. However, the 
subject is new and changing, and the foundations are still a little shaky; 
hence we omit this area from our consideration. An exception is the 
theory of absolute reaction rates, which we discuss in Chapter 11. This 
approximate theory is based on a quasi-equilibrium approach which makes 
it possible to include the theory within the framework of equilibrium 
statistical mechanics. 

Aside from the postulates of statistical mechanics themselves, to be 
introduced in the next section, the foundation on which our subject is 
based is quantum mechanics. If we seek a molecular interpretation of the 

l 
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properties of a system containing many molecules, as a starting point we 
must certainly be provided with knowledge of the properties of the in¬ 
dividual molecules making up the system and of the nature of the inter¬ 
actions between these molecules. This is information which can in prin¬ 
ciple be furnished by quantum mechanics but which in practice is usually 
obtained from experiments based on the behavior of individual molecules 
(e.g., spectroscopy), pairs of molecules (e.g., the second virial coefficient 
of an imperfect gas), etc. 

Although quantum mechanics is prerequisite to statistical mechanics, 
fortunately a reasonably satisfactory version of statistical mechanics can 
be presented without using any quantum-mechanical concepts other than 
those of quantum-mechanical states, energy levels, and intramolecular 
forces. Only in Part IV pf the book is it necessary to go beyond this 
stage. 

Another very helpful simplification is that the classical limit of quantum 
mechanics can be used, without appreciable error, in most problems in¬ 
volving significant intermolecular interactions. Problems of this type 
are very difficult without this simplification (Part IV). 

Despite our extensive use of classical statistical mechanics in the appli¬ 
cations of Parts II and III, we introduce the principles of statistical me¬ 
chanics, beginning in the next section, in quantum-mechanical language 
because the argument is not only more general but is actually much 
simpler this way. 

1-2 Ensembles and postulates. As mentioned above, our problem is to 
calculate macroscopic properties from molecular properties. Our general 
approach is to set up postulates which allow us to proceed directly with 
this task insofar as "mechanical” thermodynamic properties are con¬ 
cerned; the "nonmechanical” properties are then handled indirectly by an 
appeal to thermodynamics. By “mechanical” properties we mean, for 
example, pressure, energy, volume, number of molecules, etc., all of which 
can be defined in purely mechanical terms (quantum or classical) without, 
for example, introducing the concept of temperature. Examples of “non¬ 
mechanical” thermodynamic variables are temperature, entropy, free 
energy (Gibbs or Helmholtz), chemical potential, etc. 

Let us consider-the pressure as a typical mechanical variable. In prin¬ 
ciple, if we wished to calculate the pressure in a thermodynamic system 
from molecular considerations, we would have to calculate (by quantum 
or possibly classical mechanics) the force per unit area exerted on a wall 
of the system, taking into account the change in the state of the whole 
system with time. The force itself would be a function of time. What we 
would need, therefore, is a time average of the force over a period of time 
sufficiently long to smooth out fluctuations, i.e., sufficiently long to give 
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a time average which is independent, say, of the starting time, t = l 0 , in 
the averaging. Because of the tremendous number of molecules in a typical 
system, and the fact that they interact with each other, such a hypothetical 
calculation is of course completely out of the question in either quantum 
or classical mechanics. 

Therefore we are forced to turn to an alternative procedure, the en¬ 
semble method of Gibbs, based on postulates connecting the desired time 
average of a mechanical variable with the ensemble average (defined 
below) of the same variable. The validity of these postulates rests on the 
agreement between experiment and deductions (such as those in this 
book) made from the postulates. So far, there is no experimental evidence 
available that casts doubt on the correctness of the postulates of statistical 
mechanics. 

Before stating the postulates, we must introduce the concept of an 
ensemble of systems. An ensemble is simply a (mental) collection of a very 
large number 91 of systems, each constructed to be a replica on a thermo¬ 
dynamic (macroscopic) level of the actual thermodynamic system whose 
properties we are investigating. For example, suppose the system of in¬ 
terest has a volume V, contains N molecules of a single component, and is 
immersed in a large heat bath at temperature T. The assigned values of 
N, V, and T are sufficient to determine the thermodynamic state of the 
system. In this case, the ensemble would consist of 91 systems, all of which 
are constructed to duplicate the thermodynamic state (N, V, T) and 
environment (closed system immersed in a heat bath) of the original 
system. Although all systems in the ensemble are identical from a thermo¬ 
dynamic point of view, they are not all identical on the molecular level. 
In fact, in general, there is an extremely large number of quantum (or 
classical) states consistent with a given thermodynamic state. This is to 
be expected, of course, since three numbers, say the values of N, V, and T, 
are quite inadequate to specify the detailed molecular (or “microscopic”) 
state of a system containing something in the order of 10 20 molecules. 

Incidentally, when the term “quantum state” is used here, it will be 
understood that we refer specifically to energy states (i.e., energy eigen¬ 
states, or stationary states). 

At any instant of time, in an ensemble constructed by replication of a 
given thermodynamic system in a given environment, many different 
quantum states are represented in the various systems of the ensemble. 
In the example mentioned above, the calculated instantaneous pressure 
would in general be different in these different quantum states. The 
“ensemble average” of the pressure is then the average over these instan¬ 
taneous values of the pressure, giving the same weight to each system, in the 
ensemble in calculating the average. A similar ensemble average can be 
calculated for any mechanical variable which may have different values 



4 


STATISTICAL-MECHANICAL ENSEMBLES 


[CHAP. 1 


(i.e., which is not held constant) in the different systems of the ensemble. 

We now state our first postulate: the (long) time average of a mechanical 
variable M in the thermodynamic system of interest is equal to the ensemble 
average of M, in the limit as 31 —> oo, provided that the systems of the ensemble 
replicate the thermodynamic state and environment of the actual system of 
interest. That is, this postulate tells us that we may replace a time average 
on the one actual system by an instantaneous average over a large number 
of systems “representative” of the actual system. The first postulate by 
itself is not really helpful; we need in addition, in order to actually compute 
an ensemble average, some information about the relative probability of 
occurrence of different quantum states in the systems of the ensemble. 
This information must be provided in a second postulate. 

Note that the ensemble average of M in the limit as 91 —» oo, referred 
to above, must be independent of time. Otherwise the original system 
which the ensemble “represents” is not in equilibrium. 

We shall work out details in this chapter for the three most important 
thermodynamic environments: (a) an isolated system (N, V, and E given, 
where E = energy); (b) a closed, isothermal system (N, V, and T given); 
and (c) an open, isothermal system (p, V, and T given, where p = chemical 
potential). N and p stand for the sets N lt N 2 , ... and p i} p 2 , ... if the 
system contains more than one component. Also, V might stand for a 
set of “external variables”* if there are more than one. The representative 
ensembles in the above three cases are usually called microcanonical, 
canonical, and grand canonical, respectively. The first postulate is applica¬ 
ble to all these cases and to other ensembles which will be introduced in 
Section 1-7. The second postulate, however, can be limited to a state¬ 
ment concerning only the microcanonical ensemble. The corresponding 
statement for other ensembles can then be deduced (as in Section 1-3, for 
example) from this limited second postulate without any further as¬ 
sumptions. 

Our second postulate is: in an ensemble (91 —> oo) representative of an 
isolated thermodynamic system, the systems of the ensemble are distributed 
uniformly, that is, with equal probability or frequency, over the possible 
quantum states consistent with the specified'values of N, V, and E. In other 
words, each quantum state is represented by the same number of systems 
in the ensemble; or, if a system is selected at random from the ensemble, 
the probability that it will be found in a particular quantum state is the 
same for all the possible quantum states. A related implication of this 
postulate, when combined with the first postulate, is that the single 
isolated system of actual interest (which serves as the prototype for the 


* There is one “external variable” for each thermodynamic work term, e.g., 
volume, area, length, etc. 
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systems of the ensemble) spends equal amounts of time, over a long period 
of time, in each of the available quantum states. This last statement is 
often referred :t© as the quantum “ergodic hypothesis,” while the second 
postulate by itself is usually called the “principle of equal a priori prob¬ 
abilities.” The ergodic hypothesis in classical statistical mechanics is 
mentioned at the end of Section 6-3. For a more detailed discussion, see 
Tolman, pp. 63-70 and 356-361. (For full identification of works referred 
to by only the author’s last name, see Preface.) 

The value of E in the second postulate must be one of the energy levels 
of the quantum-mechanical system defined by N and V. Since N is 
extremely large, the energy levels for such a system will be so close together 
as to be practically continuous, and furthermore, each of these levels will 
have an extremely high degeneracy. We shall in general denote the 
number of quantum states (i.e., the degeneracy) associated with the 
energy level E for a quantum-mechanical system with N and V by 
fl(W, V, E). Thus the number of “possible quantum states” referred to 
in the second postulate is Q. 

A complication in the above discussion is the fact that, from an opera¬ 
tional point of view, E cannot be known precisely; there will always be a 
small uncertainty SE in the value of E. For all thermodynamic purposes 
this complication is completely inconsequential.* Hence for the sake of 
simplicity we ignore it. 

It should also be mentioned that the point of view in the above state¬ 
ment of the second postulate is not so general as it might be. If the energy 
level E for the system N, V has a degeneracy Q, there are Q orthogonal 
(and therefore linearly independent) wave functions ^ which satisfy the 
Schrodinger equation 3C^ = Ef. The particular choice of the 0 ^’s is 
somewhat arbitrary, since other possible choices can always be set up by 
forming suitable linear combinations of the i£’s in the first choice. In any 
case, the "12 quantum states” mentioned in connection with the second 
postulate refers to some set of orthogonal ^’s all “belonging” to the same 
E. But regardless of the set of f’s chosen, the wave function representing 
the actual quantum-mechanical state of any system selected from the 
ensemble will in general not be one of the chosen set of ^’s, but will be some 
linear combination of all of them. The contrary is really implied in the 
above statement of the second postulate. Fortunately, this simplification 
in our statement of the postulate makes no differencef in any deductions 
we shall make that can be compared with experiment. 


* See S. M. (the present author’s earlier work identified in the Preface), 
p. 113, and Mayer and Mayer, pp. 55-56, 100-102. 
t See S. M., pp. 50-55, 79. 
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We turn now to a derivation from the above two postulates of the essen¬ 
tial properties of the canonical and grand ensembles. 


1-3 Canonical ensemble. The experimental system of interest here 
has a fixed volume V, fixed numbers of molecules N (which stands for 
N i, N 2 , ... in a multicomponent system), and is immersed in a very 
large heat bath at temperature T. The heat bath is assumed “very large” 
to be consistent with the use of the limit 31 —» oo below. Our first objective 
is to set up the machinery necessary for calculating the average value of 
mechanical variables, such as energy and pressure, in the system. In view 
of the first postulate, this means that we need to be able to calculate the 
ensemble average of such variables. This, in turn, can be done if we know 
the value of the particular variable in question in a given quantum state 
and the fraction of systems in the ensemble which are in this quantum state. 
It might be noted that because the thermodynamic system here is not 
isolated but is in contact with a heat bath, the energy of the system can 
fluctuate; therefore quantum states belonging to different energy levels E 
will have to be reckoned with. Since mechanical variables have well- 
defined values in a given quantum state (in fact we can use this property 
as the definition of a “mechanical variable”), the task that remains is to 
determine the fraction of systems in the ensemble in a given quantum state 
(or the probability that a system selected arbitrarily from the ensemble is 
in a given quantum state). This is the problem we now consider. 

The experimental, or prototype, system is in a very large heat bath at 
temperature T. Therefore each system in the ensemble representative of 
the experimental system must also be in a very large heat bath at T. 
Specifically, we contemplate the following arrangement, which satisfies 
this requirement. We imagine 31 macroscopic systems as our ensemble, 
each with N and V (duplicating the values in the experimental system), 
stacked together in a lattice (Fig. 1-1). The walls between the different 
systems in the ensemble are heat conducting, but impermeable to all 
molecules. To establish the temperature T, we imagine further that the 
entire stack of systems (i.e., the ensemble) is placed in a sufficiently large 
heat bath at T. After equilibrium is reached, thermal insulation (repre¬ 
sented schematically by the double lines in Fig. 1-1) is placed on the out¬ 
side walls of the ensemble, and the ensemble is removed from the heat 
bath. The entire ensemble itself is now an isolated system with volume 
317, numbers of molecules 3lN, and a total energy which we shall denote 
by E t (f = total). The relation between E t and the temperature T will 
emerge later. Observe that each system in the ensemble is immersed in a 
large (we shall later use the limit 31 —> oo) heat bath at temperature T, 
as is required if the ensemble is to be representative of the original thermo- 
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System 

with 

N and V 


Fra. 1-1. Canonical ensemble of 91 systems, each with N and F. 

dynamic system. That is, the remaining 91—1 systems in the ensemble 
serve as the heat bath for any one selected system. 

At this point we come to the essential step in the argument, which is to 
note that since the ensemble itself is an isolated system, we can apply the 
second postulate to the whole ensemble. Thus the entire canonical ensemble 
shown in Fig. 1-1 is now regarded as a prototype thermodynamic system, 
characterized by the variables 91F, tSlN, and E t . We shall refer to this 
system as a “supersystem” in order to avoid confusion with the original 
experimental closed, isothermal system. The second postulate tells us, 
then, that every possible quantum state of this supersystem (canonical 
ensemble) is equally probable and hence should be given equal weight in 
the calculation of average values of interest. As we show next, it is possible 
to take advantage of this observation in order to find the required prob¬ 
ability of occurrence of a given quantum state in the systems of a canonical 
ensemble. 

We return now to a single system in the canonical ensemble. As a quan¬ 
tum-mechanical system, it is characterized by N and F. Let us list all 
possible energy states for such a system in increasing order of the energy 

eigenvalue, E lt E 2 , ... ,Ej .Here, for later convenience, each state is 

listed separately so that when degeneracy occurs several successive Ej s 
will have the same value. For example, in the notation used above and to 
which we shall return later, the energy value E occurs successive times 
in the list. 

Each Ej is a function of N and F. If F is changed infinitely slowly, 
each Ej changes in a continuous manner. However, the number of mole¬ 
cules of any one of the components can be changed only discontinuously— 
one molecule at a time. Hence the energy levels must jump discontinu¬ 
ously if N changes. 
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For most systems containing many molecules, it is not possible, for 
purely mathematical reasons, to actually calculate the energies E\, E 2 , 
... from Schrodinger’s equation. But for generality, we assume in the 
present argument that these energies are known. In applications, we shall 
be able to make progress in each case only to the extent that we can over¬ 
come this difficulty either by approximation, or by use of classical me¬ 
chanics, or by reducing this many-body problem to a one-, two-, three-, 
... body problem, etc. In any case, the ensemble method has the follow¬ 
ing advantage over a direct time-average calculation on a single system 
(see Section 1-2): we need only the stationary states of the system and do 
not have to follow the change in state of the system with time. 

The list of energy eigenvalues E u E 2 , ... is assumed, then, to be the 
correct list for any given problem. The argument that follows is valid 
irrespective of such complications as intermolecular forces, symmetiy 
restrictions on wave functions, etc. 

Since each system in the canonical ensemble has the same N and V, 
all systems have the same set of energy states, represented by E x , E 2 ,..., 
Ej, .... Now suppose we observe, simultaneously, the energy state of 
each system in the ensemble, and count the number of systems found in 
each of the listed states. We let be the number of systems found in 
state Ey ,..., tij in state Ej, etc. The set of numbers m, n 2 ,... is called a 
“distribution.” There are, of course, many possible distributions that 
might be observed, but obviously all must satisfy the relations 

£«, = 91, (1-1) 

$ 

'£n i E j = E t . (1-2) 

i 

The individual systems in the supersystem (canonical ensemble) are 
macroscopic in size, are arranged in a certain order, and can be separately 
labeled. Then the energy state of the whole supersystem would be com¬ 
pletely specified if we indicated the system energy state (i.e., E\, E 2 , ...) 
for each of the (labeled) systems in the supersystem. To take a simple 
example, suppose there are four systems (A, B, C, D) in the supersystem 
(31 = 4) and the possible energy states for each system are E\, E 2 , and 
E 2 . Then one possible energy state for the supersystem would be, say, 

A B C D 

E 2 E 2 E 2 Ei 

provided that (compare Eq. 1-2) 


Ei + 2 E 2 + Ez = E t (preassigned). 
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Here »i = 1, n 2 = 2, n 3 = 1. Actually, there are 12 possible states of 
the supersystem consistent with this distribution. Three of them are: 


A B C D 

K 2 E 2 E 3 E\ 

E2 E 3 E 2 E\ 

E 3 i ? 2 U 2 Ei 


But there are four sets of this type, corresponding to the four possible as¬ 
signments of Ei. In general, the number of states of the supersystem, 
G t (»), consistent with a given distribution nj, n 2 , ... (n represents the 
entire set ni, n 2 , ...) is given by the well-known combinatorial formula 


G t(n) = 


(wi + ns + 
ni!n 2 ! • • 


•)! 


31 ! 

ILV 


(1-3) 


Recall that we are attempting to find the probability of observing a 
given quantum state (say Ej) in a system selected from a canonical en¬ 
semble (or the fraction of systems in the ensemble in the state Ej). For a 
particular distribution »t, n 2 , ..., this probability or fraction is just 
nj/31 for state Ej. But, in general, there are very many possible distribu¬ 
tions for given N, V, 31, and E t . What we need is the over-all probability; 
that is, an average of ray/31 over these distributions, based on an assign¬ 
ment of equal weight to each state of the supersystem. Assignment of 
equal weights to supersystem states implies immediately that the weight 
assigned to each distribution, in calculating an average over different dis¬ 
tributions, should be proportional to 0 ( (n) for the distribution. 

Now consider the numerical example above, and suppose further that 
there are just two distributions which satisfy the conditions of Eqs. (1-1) 
and (1-2), namely, 

ni = 1, « 2 = 2, n 3 =1, G ( = 12, 

n l = 2, n 2 = 0, n 3 — 2, 0 ( = 6. 


The probability of observing E% is £ in the first distribution ?nd £ in the 
second distribution, while the over-all probability is 

IX 12+ 2X6 4 U 3 1 

” 3 12 + 6 3 ’ 31 3 ' 


In general, the required probability of observing a given quantum state 
Ej in an arbitrary system of a canonical ensemble is 
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where n,(n) means the value of in the distribution n. The sum is over 
all distributions satisfying Eqs. (1-1) and (1-2). Of course, by definition, 
ZiPi= 1 . 

Then the desired ensemble averages of, for example, the energy and 
pressure are 

E = £ P i E > 

i 

and 

T> = £ PjPi, 
i 

where pj is the pressure in state Ej, defined by 

That is, —pj dV — dE , is the work that has to be done on the system, 
when in the state Ej, in order to increase the volume by dV. 

In principle, Eq. (1-4) for Pj tells us all we need to know to calculate 
canonical ensemble averages of mechanical variables. But in practice, a 
much more explicit expression for Pj is necessary. We must now face this 
problem. 

The most elegant way to proceed is to employ the Darwin-Fowler 
technique,* based on the mathematical method of steepest descents. 
However, in the present discussion, since we can take 91 —► oo, the so- 
called maximum-term method, which involves the use of undetermined 
multipliers, is equally rigorous though not so elegant. The latter method, 
which we shall use, has the important advantage of requiring much less 
of the reader in the way of mathematical background. 

In any particular case we are given 91, the Ej (determined by N and V), 
and E t (determined by 91, N, V, and T). There are then many possible 
distributions n consistent with the restrictions of Eqs. (1-1) and (1-2). 
For each of these distributions we can calculate from Eq. (1-3) the weight 
Q t («) to be used in obtaining averages, as already explained. The situation 
here parallels exactly that illustrated in Appendix II. That is, because of 
the large numbers involved (the present example is ideal in this respect 
because we can take the limit 91 —> oo), the most probable distribution, 
and distributions which differ only negligibly from the most probable 
distribution, completely dominate the computation of the average in 
Eq. (1-4). By the most probable distribution, denoted by n*, we mean of 
course that distribution to which the largest Q*(n) belongs. In effect this 
means that, in the limit as 91 — > oo, we can regard all other weights $2 t (n) 


* See, for example, Schr&dinger, Chapter 6. 


(1-5) 

( 1 - 6 ) 

(1-7) 
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n* 


Fig. 1-2. Number of states G< as a function of the distribution n (schematic). 


as negligible compared with G ( (n*). This is illustrated diagrammatically 
in Fig. 1-2. With 91 large but finite, there would be a narrow gaussian dis¬ 
tribution centered about n = n*. But in the limit as 91 —* oo, this dis¬ 
tribution becomes completely sharp (a Dirac 3-function). 

Naturally, as we let 91 -* oo (i.e., increase the size of the ensemble), 
holding N, V, and T fixed, each n,- —» oo also. But all ensemble averages 
depend only on the ratio »y/9l, which remains finite. 

Equation (1-4) becomes, then, 


1 fl f (w*)w* n*j 
91 91 G,(»*) 91 


( 1 - 8 ) 


where nj is the value of«/ in the most probable distribution, n*. Equation 
(1-8) tells us that in the computation of P, we can replace the mean value 
of nj by the value of n, in the most probable (largest G<) distribution. 
This leads us to a purely mathematical question: Which of all possible sets 
of n/s satisfying Eqs. (1-1) and (1-2) gives us the largest G«? 

We solve this problem by the method of undetermined multipliers 
(see Appendix III). The distribution giving the largest G ( is also the dis¬ 
tribution giving the largest In G ( , since In x increases monotonically with x. 
We work with In G ( instead of 0« because it is more convenient. From 
Eq. (1-3), 

In G,(n) = n^j In ~ 2 w *' ln w *> 


where we have used Stirling’s approximation (Appendix II) and changed 
the running index from j to i. This “approximation” is in fact exact here 
because we are interested in the limit 91, n t - —> oo. According to the method 
of undetermined multipliers, the set of n/s which leads to the maximum 
value of ln G<(n), subject to the conditions (1-1) and (1-2), is found from 
the equations 

^^lnG t (n) - « 2 2 = °> i = 1, 2, ..., 
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where a and 0 are the undetermined multipliers. On carrying out the dif¬ 
ferentiation, we find 



n*i = %e- a e- >E ‘\ j= 1,2, .... (1-9) 

This is the most probable distribution, expressed in terms of a and 0. 
If desired, 91 may be substituted for £,• n,- in In Q<(») at the outset, and 
treated as a constant in the differentiation. This will change the meaning 
of a, but not any physical results. 

The straightforward procedure here is to substitute the distribution 
(1-9) into Eqs. (1-1) and (1-2) in order to determine a and 0 as functions 
of 91 and E t , or of 91 and E (since obviously E t = 9 lE). The result is 


e“ 



E = 


Y,j Eje~ tBi 


( 1 - 10 ) 

( 1 - 11 ) 


where 91 has dropped out of both equations. Equation (1-11) provides 0 
as an implicit function of E (and also of N and V, since the energies Ej 
are functions of N and V). Equation (1-10) then gives a in terms of 0 
(and N, F). However, the independent variables of real interest here are 
N, V, T rather than N, V, E, and we have no information yet about the 
dependence of E on T. Hence we do not pursue the above approach any 
further (see Problem 1-2, however), but turn instead, in Section 1-4, to 
a thermodynamic argument which provides a direct connection between 
0 and T. 

We note in passing that elimination of e”“ in Eq. (1-9) by use of 
Eq. (1-10) (or comparison of Eqs. 1-5 and 1-11) gives us Py as a function 
of 0, N, and F: 



e -/»E,(N,F) 


j= 1,2,.... 


( 1 - 12 ) 


Anticipating the fact that 0 turns out to be a positive number, we deduce 
from this equation that the probability of observing a given quantum 
state in a canonical ensemble decreases exponentially with the energy of 
the quantum state. 


1-4 Canonical ensemble and thermodynamics. To bring nonmechanical 
thermodynamic variables such as temperature and entropy into our dis- 
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cussion, we now combine the above “mechanical” considerations with 
thermodynamics. In the first place, by virtue of the first postulate, we can 
associate the thermodynamic pressure p and energy E with the statistical- 
mechanical ensemble averages JS and E. Let us take the differential of E 
in Eq. (1-5), holding N constant (the system being closed): 

dE - '£E j dP j + Y l P i dE j 

3 3 

= - | E On Pi + ^ Q) dPi + 2 p i (W)„ dV ’ (1 “ 13) 

where we have defined 

q = E (1-14) 

i 

Used Eq. (1-12) in the first sum, and have recognized in the second sum 
that Ej(N, V) can vary only with V if N is fixed. The first sum simplifies 
further in view of the relations 

2 p i=l> 2 dP i = °> 

3 3 

and 

<*(2 p > lnP >) = 2 lnp i<* p >- 

Thus, using Eq. (1-6), we can write 

- | d ^2 p i ln P j) = dE + fdV. (1-15) 

Since we already have the associations with thermodynamics E *-*"E and 
p *-* JJ, and since in thermodynamics (N constant) 

TdS= dE + pdV, 

we can deduce from Eq. (1-15) the further association 

TdS^- jjd^Pyln P>) . (1-16) 

With these associations established, let us digress to note that from 
Eq. (1-13) and 

dE = DQ* — DW, 
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we have 

DQ*= TdS H^^dPy, 

i 

(1-17) 


DW = pdV £ Py dE jt 

(1-18) 


i 


where Q* and W are heat absorbed and work done by the system, respec¬ 
tively. These relations provide us, in a general way, with the molecular 
interpretation of the thermodynamic concepts of heat and work. We see 
that when a closed thermodynamic system increases its energy infinitesi¬ 
mally by the absorption of heat from its surroundings, this is accomplished 
not by changing the energy levels of the system but rather by a shift in 
the fraction of time the system spends in the various energy states. The 
converse statement can be made about the work term. 

We now return to the main argument, the purpose of which is to relate 
S to the Pj. From Eq. (1-16), 

dS <-> -y, dG, (1-19) 

where G is defined by 

G -gP,lnP,. 

i 

From thermodynamics we know that the left side of Eq. (1-19) is an exact 
differential. Hence the right side must be also. This condition will be met 
provided that 1/fiT is any function of G, say <p(G). That is, 

dS «-» <p(G) dG = df(G), (1-20) 

where i 

f(G)=j<p{G)dG, „«7) = *. 

From Eq. (1-20), 

S <-> f(G) + c, (1-21) 

where c is an integration constant independent of G and therefore inde- 



AB 


Fio. 1-3. Systems A and B combined to form AB. All systems are at 
same temperature. 
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pendent of the variables on which G depends (e.g., 0 and V, with N 
constant). In thermodynamic language, c is independent of the thermo¬ 
dynamic state of a closed system. Experimental information about the 
entropy always involves a difference in entropy between two states (e.g., 
the entropy change AS between T i and T 2 at constant N and V), never an 
absolute value. The constant c in Eq. (1-21) always cancels on taking 
such a difference. Hence its value is completely arbitrary from an opera¬ 
tional point of view. But for convenience and simplicity, we adopt the 
particular choice c = 0 from now on. The connection between this choice 
and the third law of thermodynamics will be discussed in Section 2-4. 

Up to this point we have that S <-> f(G), but we do not know the func¬ 
tion /. To settle this matter we make use of a thermodynamic property 
of the entropy, namely its additivity. Specifically, suppose we have two 
thermodynamic systems A and B at the same temperature and with en¬ 
tropies Sa and Sb- Then if we regard the combined systems (Fig. 1-3) as 
a new system AB, we have Sab = Sa + Sb- This relationship suffices 
to determine /, as we now show. 


A 

B. 

A 

B 

B 

A 

B 

A 

A 

B 

A 

B 

B 

A 

B 

A 


Fig. 1-4. Canonical ensemble of 91 systems, each of type AB. 

We first investigate whether the statistical-mechanical quantity G is 
additive in the above sense. For this purpose we form a canonical ensemble 
of 91 systems AB (as shown in Fig. 1-4) representative of a thermodynamic 
(prototype) AB system at temperature T. Heat can flow through all in¬ 
terior walls of the ensemble. The A part of the thermodynamic system 
is characterized further by N A and V A , and the B part by N B and V B 
(A and B are not exponents). In general, the types of molecules may be 
different in A and B. We have two sets of energy states for the separate 
systems, E A , E A , ■.. and Ef, E B , .... If nf stands for the number of 
A systems in the ensemble in state Ef, with a similar meaning for nf, 
then the number of states of the whole ensemble (Fig. 1-4), or super- 
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system, consistent with given distributions n A and n B is 
° t(n ’ n } - n t nfl X Jhnfl 


( 1 - 22 ) 


since the A and B systems are independent of each other (except for 
energy exchange through the walls). The distributions of interest must 
satisfy the equations 

E n * = 9l > E n ? = 91. 


E + W ?^ B ) = p ‘- 


The argument from here on is essentially the same as before, so we 
omit details (Problem 1-3). The three restrictions above require three 
undetermined multipliers, a a, as, and j8, respectively. We note in par¬ 
ticular that because of energy exchange between the A and B systems, 
only one energy equation and one multiplier 0 are necessary. For the 
probability that the thermodynamic system AB has its A part in state 
Ef and its B part in state Ef, we find 



This multiplicative property of Pa is of course what we should expect 
from the form of Eq. (1-22). We deduce from Eq. (1-23) that if two sys¬ 
tems are in thermal contact with each other (and therefore have the same 
temperature), they have the same 0. This suggests a close connection 
between 0 and T, which we verify below. 

For the combined system AB, 

Gab -E p .> ln P <> 

= - E P * P f ( In P < + ,n P f) 

uj 

= - E P i ln P i - E P f ln P f 

i $ 


= G a + G b . 


( 1 - 24 ) 
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That is, 6 is additive. Also, since Sab = Sa + Sb, we have 
KGab) = /«?a) +/«?*). 

Then, from Eq. (1-24), 

KGa + Gb) = KG a ) + f(G B ). 

The question before us becomes, then: Given that 

/(* + y) =/(*) +f(y), (1-25) 

what is the function /? Let us differentiate* Eq. (1-25) with respect to 
x and y: 

df(x + y) d(x + y) dfjx + y) dfjx) 

d(x + y) dx d(x + y) dx 

dfjx + y) d(x + y) _ dfjx + y) _ dfjy) 
d(x + y) dy d(x + y) dy 

Hence 

dfjx) _ dfjy) 
dx dy 

This says that a certain function of x is equal to the same function of y. 
But this is only possible if the function is a constant, say k. Then 

= k, f(x) = kx +a, 


where a is another constant. But we have to choose a = 0 in order to 
satisfy Eq. (1-25). Therefore, finally, we have found that f(x) = kx, 
and that 

S+->f(G) = kG 





kT 


f}. 


(1-27) 


* This argument is from Schrodinger, p. 13. 
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The constant k is still unevaluated at this stage. We have seen that if 
any two systems are in thermal contact, they have the same 0 and T. 
Therefore they have the same k. What is more, k is a universal constant, 
since one system of the pair, say A, can be retained and B can be varied 
over all other possible systems, C, D, E,... . The value of k can thus be 
obtained once and for all by comparing statistical-mechanical and ex¬ 
perimental values of the same property, on any convenient system {A, 
above). The pressure of an ideal gas is usually used. The numerical value 
of k depends, of course, on the absolute temperature scale employed. We 
anticipate from our treatment of an ideal gas in Chapter 4 that k = 
+1.38044 X 10 —16 erg • deg -1 , with the conventional kelvin tempera¬ 
ture scale. However, the important fact that k is a positive number can 
easily be checked here in several ways. For example, if we put fi = 1/kT 
in Eq. (1-11), differentiate with respect to T, and use the experimental 
thermodynamic fact that Cv = (dE/dT)s,v is always positive, we find 
that k must be positive (Problem 1-4). 

We are now in a position to summarize the basic statistical-mechanical 
equations that can be used to calculate the thermodynamic properties of 
a closed, isothermal system. In the first place, the probability that the 
system is in any particular energy state Ej is 

(1 - 28) 

wh er6 

Q{N, V, T) = £ e -BjiN.VUkT (1-29) 

j 

We call Q the “canonical ensemble partition function.” Because of the 
association (1-27), the independent thermodynamic variables here turn 
out to be N, V, and T, which is just the desired set for a closed, isothermal 
system (see Section 1-3). The entropy is 

S(N, V, D = ~k 2 Pi ^ Pi, (1-30) 

i 

where Pj is given by Eq. (1-28). If we substitute Eq. (1-28) into 
Eq. (1—30), we find 

S = 1 + fclnQ = f ~ 

where the last expression is a thermodynamic one (A is the Helmholtz 
free energy). Therefore 

A(N, V, T) = -kT\nQ(N, V, T). (1-31) 

This equation is particularly useful because A is the “characteristic func- 
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tion” in thermodynamics for the independent variables N, V, T: 

dA = -SdT - pdV + £ /*« dN a . (1-32) 


Thus, 



(1-33) 

(1-34) 

(1-35) 


Hence, if the function Q(N, V, T) is available from Eq. (1-29), differentia¬ 
tion of Q yields S, p, and E. Furthermore, despite the fact that Eq. (1-31) 
was derived from the study of a closed system, we can make use of the 
thermodynamic equation (1-32) and Q(N, V, T) to deduce the chemical 
potential of any component, say i, from 



(1-36) 


Thus we have a complete set of thermodynamic functions (from which 
all others can be derived): N, V T; A, E, S, p, p. Incidentally, whether 
the averaging bars over E and p in the above equations are dropped or 
not is optional; it depends on whether one has in mind primarily the 
thermodynamic or the statistical-mechanical aspect of the equation in 
question. 

The above equations, which allow us to deduce all the thermodynamic 
properties from Eq. (1-29) for the partition function Q, are general but 
quite formal. In fact, the reader may feel that these relations are rather 
useless since, in general, the Ej must be expected to be very difficult to 
calculate for a system with many molecules. While such an attitude is 
perhaps justified in complicated cases, there are many systems for which 
considerable progress of one kind or another can be made. Much of the 
rest of this book will be devoted to such examples. 

For many purposes it is convenient to group together all energy states 
belonging to the same energy level. Let Qi(N, F) be the number of such 
states (that is, the degeneracy) for an energy level Ei(N, V). In other 
words, in the list of energy states E u E 2 , ..., the same value Ei occurs 12, 
times. Then, 


Q(N, F, T) 


£ -E^N.vmT = £ Q . (N) V)e -E i( N,vmT' 

i i 

(states) (levels) 


(1-37) 



20 


STATISTICAL-MECHANICAL ENSEMBLES 


[CHAP. 1 


Also, 

Qe- EliT 

P (level) = UP (state) = -=-q — (1-38) 

is the probability that the system exists in the energy level E. We have 
dropped subscripts here to avoid confusion between i and j. Whether a 
sum such as one of those occurring in Eq. (1-37) is over “states” or “levels” 
can always be judged by noticing whether or not degeneracies are included 
as weights for the so-called Bpltzmann factors (e~ E > lkT ). 

We have already mentioned that Pj, being proportional to the Boltz¬ 
mann factor e~ B i liT , falls off exponentially with increasing Ej. We shall 
discuss essentially this point in more detail in Chapter 3, but in anticipa¬ 
tion we should mention here two important extreme cases: 

(a) If T —* 0 and the lowest level E x is nondegenerate, then 

Q —> e~ ElliT [l + U 2 e~ (Ea ~ El)lkT H-] —► e~ BllkT 

and 

P x -* 1, Pj 0, j= 2,3,.... 

That is, in the limit as T —♦ 0, the system is certain to be found in the 
lowest energy state. From Eq. (1-30), S —► 0. 

(b) If T —> oo, the relative effect of different Ej ’s on the Boltzmann 
factors is washed out, and Pj (state) —> constant (independent of j); that 
is, the probability distribution over states becomes uniform. Then S—*ao, 
assuming that there is an infinite number of energy states (Problem 1-5). 

1-5 Grand canonical ensemble. In this section we suppose that the 
thermodynamic system of volume V, whose properties we wish to calcu¬ 
late from molecular considerations, is in a large heat bath and is “open” 
with respect to the molecules in the system. That is, both heat and matter 
(molecules) can be transported across the w;alls of the system. The bath 
provides a reservoir of heat at temperature T and of molecules at chemical 

potentials pi, ju 2 >- The system is thus characterized by the thermo- 

dymamic variables V, T, n x , p 2 , ■ .. . The numbers of molecules N x , N 2 , 
... do not have fixed values, as they do in a closed system, but fluctuate 
about mean values fl 2 ,.... 

We employ here the same type of argument as for the canonical en¬ 
semble: (a) the first postulate permits us to use ensemble averages over 
mechanical variables in place of time averages on the actual system; 
(b) by regarding the entire ensemble as an isolated supersystem, we can 
deduce ensemble-average weighting (probability) factors from the second 
postulate, in terms of undetermined multipliers; and (c) the significance 
of the undetermined multipliers, as nonmechanical variables, can then be 
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Fig. 1-5. Grand canonical ensemble of 91 systems, each with F. 

established by comparing statistical-mechanical and thermodynamic 
expressions for mechanical variables. 

For simplicity, we consider now a one-component system, with V, T, and 
H given. As an ensemble (which we call a “grand canonical ensemble”) 
representative of this system, we introduce a lattice (Fig. 1-5) of 91 sys¬ 
tems, each with volume F and with walls permeable to molecules (indi¬ 
cated by the dashed lines) and to heat. To establish the desired values of 
T and p in each of the 91 systems, we imagine that the whole ensemble is 
immersed in a giant reservoir at T and p until equilibrium is reached. We 
then place walls around the ensemble (solid double lines in Fig. 1-5) 
that are impermeable to both heat and molecules, and finally remove the 
ensemble from the reservoir. The ensemble itself is then an isolated super¬ 
system to which the second postulate can be applied. The volume of the 
supersystem is 91F, and we let E t and N t be its total energy and number of 
molecules. 

Since the subsequent details are very similar to those in Sections 1-3 
and 1-4, we condense the discussion here. For each value of N, there will 
be a different set of energy states Ej(N, F). The quantum-mechanical 
state of the supersystem (ensemble) is specified when we give the value 
of N and the state Ej(N, V) for each system in the supersystem. In a 
given state of the supersystem, let nj(N) be the number of systems which 
contain N molecules and are in the particular energy state Ej{N, F). 
N can range from zero to infinity (unless there is some upper limit set by 
the model being used). For a given distribution », that is, for a set of 
numbers 

«i(0), 

Wi(l), n 2 (l), 7 * 3 ( 1 ),..., 

»i(2), n a (2), n 8 (2),..., 
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the number of possible quantum states of the supersystem is 


q<(») = 

Acceptable distributions must satisfy the conservation relations 


2>,(A0 = 91, 
i.if 

'£n J (N)Ej(N, V) = E„ 

j.if 

X nj(N)N = Nf 

i.lf 


(1-39) 


(1-40) 

(1-41) 

(1-42) 


If we let a, 0, and 7 be the respective undetermined multipliers, the most 
probable distribution turns out to be (Problem 1-6) 


n*j(N) = fJLe- a e- 0B i {N ' V) e- yN . (1-43) 

Again, in principle we can substitute Eq. (1-43) into Eqs. (1-40) through 
(1-42) and find a, 0, and 7 as functions of 31, E t , and N t . But, instead, 
we follow a procedure analogous to that used for the canonical en¬ 
semble. 

From Eqs. (1-40) and (1-43), 


and 


e'= £ e-W^N.V^-yN 
i.N 


(1-44) 


PAN) = 


_ nAN) _ n*(N) _ e -^N,V)--,N 


91 


91 


E e-'W- y>e~ rlf ' ’ t 1-45 ) 


where PAN) is the probability that a system selected at random from the 
grand ensemble will contain N molecules and be in the energy state 
Ej(N, F); or, PAN) is the probability that the single prototype thermo¬ 
dynamic system contains exactly N molecules and is in the energy state 
Ej(N, F). We note that PAN) has an exponential dependence on both 
Ej(N, V) and N. An open system has a definite volume, but both the 
energy and number of molecules in the system fluctuate. In a closed, 
isothermal system (canonical ensemble), N is fixed but the energy fluctu¬ 
ates. The magnitude of these fluctuations will be examined in Chapter 2. 

From the first postulate, we have the associations 


E «-» E(= Et/fSl) = X) PAN)EAN, F), 

j.N 


(1-46) 
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N ~ 37(= N t / 91) = £ Pj(N)N, 

j.N 


(1-47) 


These are mechanical variables. To include nonmechanical variables and 
to evaluate 0 and 7, we utilize the expression 

d'E = £ V) dP s {N) + X) P/W dP/(AT, V). (1-49) 

j.N j.N 

Since we are summing over all values of j and N, Ej(N, V) is in effect a 
function of V only in the second term on the right. In the first term, we 
substitute for Ej(N, 7) from Eq. (1-45). Then, 

dE “ ~ 4 E W + ln W) + In B] dP,(N) 

P j.N 

+ 'E P jW aEi % V) dV, (1-50) 


Using [Eq. (1-47)] 


Eq. (1-50) simplifies to 


_ ^ e ->*jlN.V) e -yN 
j.N 


dN - '£,NdP j (N), 

j.N 


We compare this with the thermodynamic equation 


and conclude that 


TdS = dE + pdV - itdN, 


r 


(1-51) 


- id[EPiM 1" PjW)] - dE + p dV + JdF. (1-5 


(1-53) 


(1-54) 


T dS ~ | d [g FXW) ln P,(W)]. (1-55) 


By the same kind of lengthy argument already employed for the canoni- 
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cal ensemble, we arrive at the further results 

S <-* —k^ Pj(N) In P,(N), (1-56) 

i.N 

~ A (1-57) 

and therefore, from (1-54), 

jgi ~ -*• (1-58) 

The relation (1-56) has the same formal appearance as Eq. (1-30) in the 
canonical ensemble. In fact, this form for S is quite general (see Problem 
1-8, for example). 

According to Eq. (1-45), there is only one /3 (and therefore k) for all 
values of N. Furthermore, this is the same fi as in Section 1-4 for a closed, 
isothermal system, since a grand ensemble is just an aggregate of canoni¬ 
cal ensembles. That is, we can imagine “freezing” the composition of the 
systems in a grand ensemble by suddenly inserting, between the systems, 
walls which are heat conducting but impermeable to molecules. Then the 
original grand canonical ensemble becomes simply a collection of canonical 
ensembles (in fact, this is the significance of the word “grand”) in thermal 
contact with each other, each characterized by a definite N. 

Let us now summarize results for an open, isothermal system whose 
thermodynamic state is specified by the variables V, T, and tx. The prob¬ 
ability that such a system contains N molecules and is in the energy state 
Ej(N, V) is 

-E,(IV.V)/tT NflkT 

P}(E;V, T, M ) = - g {V} Ttlx) -> (1-59) 

where 

E(F, T, m) = E e ~ E > l ' N ' V)lkT e N,lliT . (1-60) 

i.N 

We call E the “grand partition function.” The notation used for P in 
Eq. (1-59) means that N and j are essentially running indices (the notation 
Pn} might have been used), while V, T, and n are independent thermo¬ 
dynamic variables. An alternative form for E is 

S(F, r, m) = E E e- B ^ N - VUkT 

n L i 

= 'Z.QiN, V, T)e Nf,kT . 

N 


(1-61) 
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The probability that the system has N molecules, irrespective of the 
energy state, is 

P(N; V, T, M ) = 2 = —s tv Srfli ” 

Thus, for example, the average value of N is 


.Nn/tr 


(1-62) 


m v, T, p) = ■ (1-63) 

If we substitute Eq. (1-59) into Eq. (1-56), we find 

s = | — T f + fcbl2 = | — 7^ + ?’ 
where the last expression is thermodynamic in origin. Hence 

pV = kT)n 3 (V, T, p). (1-64) 


Now pV is the thermodynamic characteristic function for the variables 
V, T, and p: 

d(pV) = SdT + Ndp + pdV. (1-65) 


Therefore, from Eq. (1-64), we have the following relations which, to¬ 
gether with Eq. (1-64), permit us to calculate all the thermodynamic 
properties of a system if E is known as a function of V, T, and p: 


S = 

AT = 

p = 



( 1 - 66 ) 

(1-67) 

( 1 - 68 ) 


The last form of Eq. (1-68) follows from Eq. (1-64) or, on thermodynamic 
grounds, from the fact that the variables held constant in the derivative 
are both intensive. 

We shall see in Chapter 2 that one can choose an ensemble from which 
to calculate thermodynamic functions on the basis of convenience, and 
irrespective of the actual environment of a system (heat bath, constant 
pressure, etc.). In many problems the grand ensemble is easier to use than 
the canonical ensemble. When this is the case, the reason is usually either 
(a) that a mathematically awkward restraint of constant N in the canoni¬ 
cal ensemble can be avoided by summing over N (Eq. 1-61), or (b) that 
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a many-body problem can be reduced to a one-body, two-body, etc., 
problem by viewing Eq. (1-61) as a power series in the “absolute activity” 
X = e l ‘ ,kT : 

H (7, T, ft) = <m 7, T) + Q(l, 7, T)\ + Q(2, 7, T)\ 2 + • • •. (1-69) 

This is the preferable method in treating an imperfect gas, for example 
(Chapter 15). 

The above discussion is limited to a one-component system, but it can 
easily be extended to any number of components (Problem 1-7). For 
example, for two components there will be two equations like (1-42) and 
two undetermined multipliers, 7j = —pi/kT and 72 = —p 2 /kT. Equa¬ 
tion (1-59) becomes 

P i( N it N 2 ; 7, T, mi, Ms) = f, ft^l) ’ ^ 

with 

S= 2 „ N 2 , 7, (1-71) 

»i.nr* 

where 

X! = e^' kT , X a = e** lkT . 

p7 = kT In S (7, T, Ml, Ms), (1-72) 

d(pV) = SdT + Ni dm + N 2 dp 2 + p dV, (1-73) 

from which we can immediately write the extensions of Eqs. (1-66) 
through (1-68). 

1-6 Microcanonical ensemble. Here we are concerned with an isolated 
system with given E, 7, and N (N again represents a set N i, N 2 , ... if 
the system is multicomponent). The representative ensemble is called a 
microcanonical ensemble, as stated in Section 1-2. For an isolated system 
it is difficult to achieve a direct connection between our two postulates 
and thermodynamics (e.g., we have used variations in 2? for this purpose 
in Sections 1-4 and 1-5, but here E is constant). The most common pro¬ 
cedure for avoiding this difficulty is to introduce, essentially as a new 
postulate, the equation S — k In Q, where U(N, 7, E) is the degeneracy 
of the energy level E (see Section 1-2). However, a new postulate is not 
really needed; its introduction is therefore unsatisfactory from a logical 
point of view. Instead, we derive the properties of a microcanonical en¬ 
semble from either the canonical ensemble or the grand ensemble. 

First, consider a canonical ensemble. A microcanonical ensemble, as the 
name is meant to imply, is a degenerate canonical ensemble in which all 
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systems have (virtually) the same energy. Thus, suppose we start with a 
canonical ensemble, pick out just those systems with an energy level E, 
place thermal insulation around each of them, and then remove these 
systems from the other systems in the canonical ensemble (with energies 
different from E). As a result of this operation, we have a collection of 
isolated systems, all with the same N, V, and E (a microcanonical en¬ 
semble). This degenerate canonical ensemble 4 ' may be thought of as being 
representative of a hypothetical closed, isothermal system that is somehow 
restrained from having values of E other than E — E. Another way of 
saying this is that the only quantum states accessible to the system are 
those with energy E. In this new ensemble, according to Eq. (1-28), the 
fraction of systems Pj in a given quantum state (energy E) is proportional 
to e~ BliT . But E is the same for all quantum states, fl(N, V, E) in number. 
Hence Pj is the same for all Q quantum states. Since Pj = 1 , Pj = 
1/Q. Then, from Eq. (1-30), 

S(N, V, E) = Pj In Pj = -Ml (i In 

= klna(N,V,E). (1-74) 

This relation between the thermodynamic S and statistical-mechanical 
(actually, quantum-mechanical) Q can then be employed to derive all 
thermodynamic functions of interest, as we shall see below. 

A microcanonical ensemble is also a degenerate grand ensemble: we 
can pick out of a grand ensemble only those systems with certain pre¬ 
scribed values of N and E. But there is a different, in fact, complementary, 
way in which Eq. (1-74) can be deduced from a grand ensemble. 

In what follows, we have to make use of Section 1-5, which was re¬ 
stricted for simplicity to a one-component system, but the method and 
result are independent of the number of components. In Section 1-5 we 
applied the second postulate to the whole grand ensemble, or supersystem 
(Fig. 1-5). That is, the supersystem itself is an example of an isolated 
system. The point of view we adopt here is that the supersystem may be 
regarded, for present purposes, not as an imaginary construct but as a 
single, very large, real isolated system. In this case, the dashed lines in 
Fig. 1-5, dividing the “supersystem” into “systems,” represent mathe¬ 
matical rather than physical planes. A “system” (Fig. 1-6) is then an 
imaginary macroscopic portion, between mathematical planes, of the 
total or “supersystem.” Each “system” is open and isothermal. Note that 


* We shall see in Section 2-2 that, because relative fluctuations in E about 
Sin a canonical ensemble are virtually negligible in magnitude, this somewhat 
artificial way of forming a microcanonical ensemble is really unnecessary: 
canonical and microcanonical ensembles are essentially indistinguishable in any 
case. 
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System 
with V 


Fig. 1-6. Portion of a larger system forming a smaller, open, isothermal 
system of volume V. 


we cannot use this point of view in connection with a canonical ensemble, 
because this ensemble requires that each “system” be closed. The way we 
proceed is to employ Eq. (1-39) to find the total number of quantum 
states ft ( of the “supersystem” (specified E t , N t , V t = 91F) and relate 
this number to the entropy St of the “supersystem” using (a) the connec¬ 
tions with thermodynamics already found for an open, isothermal system 
(Eqs. 1-59 through 1-68), and (b) the additive property of the entropy, 
S t — 915, where S is the entropy of one “system” in the “supersystem.” 
To evaluate In ft t , we start with 


In fi« = In ^ Q<(») = In Q*(n*) 
» 


and then substitute 


»*(A0 = 


^-B^NiV)lkT e N^lkT 

s 


) 


(1-75) 

(1-76) 


which follows from Eqs. (1-45) and (1-59), into In G t (n) from Eq. (1-39), 
to give In ft«(»*). We find 

In 0{(n*) = 91 In 91 — ^ n*(N) In n*(N) 
i.ff 


9l2? 91?7 m 

= w-~w + 911,12 

~ \kT kT ^ kT/ ~ k 

so that 

S t = k In 0«, 


in agreement with Eq. (1-74). 


(1-77) 
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Equation (1-74), due to Boltzmann, is possibly the best-known equa¬ 
tion in statistical mechanics, mainly for historical reasons. We shall dis¬ 
cuss it further in Chapter 2. But let us note here that: (a) it applies to 
an isolated system; (b) if the ground state of the system is nondegenerate, 
12 = 1 and S = 0 (T —»0); and (c) for any isolated system whatever, the 
more quantum states available to the system, the higher the entropy. This 
is the origin of qualitative statements which correlate the entropy with 
“probability,” “randomness,” “disorder,” etc. We shall encounter many 
examples of such a correlation in the present book. Incidentally, we can 
appeal to thermodynamics for a quick estimate of the magnitude of 12 
to be expected in statistical mechanics in general. That is, since it is found 
experimentally that* S = 0(Nk), In 12 = O(N), and 12 == 0(e N ) = 
0(10’° 2 ), an impressively large number. 

Let us assume that, from quantum mechanics, we have 12 (N, V, E) for 
the system of interest. Our next problem is to calculate all thermodynamic 
functions, not just the entropy S. But S is the characteristic function for 
the variables N, V, E : 

dS = j, dE + I dV - £ ^ dN a . (1-78) 

a 

Hence 


1 

/d In 12 n 


(1-79) 

kT ~ 

\ dE / 

) > 

' v.x 

II 

(d In 12 n 
\ dV j 

) > 

' E,N 

(1-80) 

ii 

(d In & 
\ dNt > 


(1-81) 


The temperature is determined by the dependence of 12 on E. Since we 
know from thermodynamics that T is positive, we can anticipate that 12 
will increase with E for any macroscopic quantum-mechanical system. 
Clearly, the same statement can also be made about 12(F). 

In practice, except in very simple systems, 12(IV, F, E) is not available, 
and the microcanonical equations (1-74) and (1-79) through (1-81) can¬ 
not be utilized. In particular, the restriction to constant energy E is 
usually a difficulty. This can be avoided by passing to the canonical en¬ 
semble. 


* The notation 0(x) means “of order x. 
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1-7 Other ensembles. Many other ensembles and partition functions 
are possible and are often useful. For example, for a one-component sys¬ 
tem, if we start with Q(N, V, E) and then sum one at a time over E, N, 
and V, we obtain* the four relations (we write 1/kT = p here for con¬ 
venience), 

In Sl(N, V,E) = pTS = |» (1-82) 

In Y a ( N > v > = function of N,V,p = -pA, (1-83) 

E 

In Y Q(N, V, E)e tpN = function of V, E, fin = pH, (1-84) 

N 

In Y 0(y, V, E)e~ tpr = function of N, E, pp = p(TS - pV). (1-85) 
v 

In Eq. (1-84), H is the “heat content,” E + pV. The summations over 
possible values of E and V may be replaced by integrations in most prob¬ 
lems, as we shall see. But here we use the present simple notation for 
convenience. The sum in Eq. (1-83) is the same, except for notation, as 
the sum over energy levels in Eq. (1-37). Previously we established only 
Eqs. (1-82) and (1-83), but the other two cases, and those below, can 
be worked out in detail by the same general methods already used 
(Problem 1-8). 

Continuing, we can also sum two at a time over E, N, and V: 

In Y V, E)e~ 0B e s “ N = function of V, P, p = PpV, (1-86) 

BM 

In Y V, E)e~ fB e~ tpV — function of N, p, p 

B.V 

= — PNp = -PF, (1-87) 

In Y Q (N, V, E)e SpN e~ >pV — function of E, Pp, pp = PE. (1-88) 

N.K 

Equation (1-86) is the logarithm of the grand partition function, already 
encountered. The other two equations are new. Equation (1-87) is par¬ 
ticularly importantf because it is applicable to a system with the familiar 


* T. L. Hill, J. Chem. Phya. 29, 1423 (1958). See also A. Munster (Supple¬ 
mentary Reading list); W. B. Brown, Mol. Phya. 1, 68 (1958); A. Munster, 
ibid. 2 ,1 (1959); R. A. Sack, ibid. 2, 8 (1959). 
t W. B. Brown, Mol. Phya. 1, 68 (1958). 
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set of independent variables N, T, and p. In Eq. (1-87), F is the Gibbs 
free energy, A + pV, equal to Np for a one-component system. Finally, 
we can sum over all of E, N, and V: 

In Y, v > E)e-^ B e^ N e-' ,pV = function of P, p, p = 0. (1-89) 

B.N.V 

This is an exceptional case, since T, p, and p appear to be independent 
variables, whereas we know from thermodynamics that at most two of 
these variables can be independent. The special treatment necessary for 
this partition function is provided elsewhere (S. M., Chapters 2 and 3). 

The characteristic functions are redefined here, in a more systematic 
way, as dimensionless quantities. In every case there is an appropriate 
thermodynamic equation which permits calculation of other thermody¬ 
namic functions from knowledge of the partition function. For example, 
for Eq. (1-83), 

d{-pA) = -EdP + fipdV - PpdN, (1-90) 

or for Eq. (1-87), 

d(—fiF) = —Edfi — Vd(fip) - ppdN. (1-91) 

Equation (1-90) is, of course, just a rearrangement of Eq. (1-32) for a 
closed, isothermal system. 

The reader has perhaps noticed that the characteristic function can be 
written immediately on inspection of the partition function. The rule is: 
if we replace Q by e ffTS , then the characteristic function is the sum of the 
exponents in the partition function. For example, for Eqs. (1-83) and 
(1-87), respectively: 

1 3TS - fiE = -pA, 

(1-92) 

18 TS -PE - PpV = —pF. 

The reason for the existence of this rule will be obvious from our discussion 
of the thermodynamic equivalence of ensembles in Chapter 2. It depends 
on the legitimacy of replacing the logarithm of a partition function by the 
logarithm of its maximum term. 

There are two further types of ensembles or partition functions we 
should mention, since they will be encountered in the applications. First, 
there are some problems in which the “external variable” V is replaced by 
another external variable (e.g., length or area) or is supplemented by addi¬ 
tional external variables (e.g., volume and area are both external vari¬ 
ables). Second, in multicomponent systems, there are cases in which it is 
helpful or necessary to regard the system as open with respect to some 
components, but not all. 
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Problems 

1-1 .Modify the derivation, in Section 1-3, of the canonical ensemble probability 
distribution (1-12) to obtain Eq. (1-38) directly. That is, use “levels” instead of 
“states.” 

1-2. Obtain the derivatives (dT2/dV)t,tt and (df/dfi) N y from Eqs. (1-5), 
(1-6), and (1-12), and show that 

(SL 

This result is of interest because comparison with the thermodynamic equation, 



suggests that jS = constant/T, as we find in Section 1-4. (Page 12.) 

1-3. Derive the canonical ensemble probability distribution (1-23) for a 
combined system, starting with Eq. (1-22) and the associated restraints. 
(Page 16.) 

1-4. Make use of Eq. (1-11) for "E, the relation /S = 1/kT, and the experi¬ 
mental fact that Cv is always positive to prove that k is a positive constant. 
(Page 18.) 

1-5. Use the method of undetermined multipliers to show that —51/ P/ In P/, 
subject to the condition 51/ P/ = 1, is a maximum when P/ = constant. 
(Page 20.) 

1-6. Use the method of undetermined multipliers to find the most probable 
distribution in a grand ensemble, Eq. (1-43). (Page 22.) 

1-7. Derive the basic equations (1-70) and (1-72) for an open system with 
two components. (Page 26.) 

1-8. Derive Eq. (1-84) for a system of fixed volume in thermal and material 
contact with a reservoir at T and p, but with the transports of heat and mole¬ 
cules across the walls of the system coupled in such a way as to maintain E 
constant. (Page 30.) 

1-9. Verify that Eqs. (1-34) and (1-35) for p and E in terms of Q are equiv¬ 
alent to Eqs. (1-5), (1-6), and (1-28) in terms of ensemble averages. 

Supplementary Reading 

Kittel, Part I. 

Mayer and Mayer, Chapters 3, 4, and 10. 

Munster, A., Handbuch der Physik, Vol. III/2, pp. 176-412. Berlin: 
Springer, 1959. 

Munster, A., Statistische Thertnodynamik. Berlin: Springer, 1956. 

SchrSdinger, Chapters 1, 2, and 6. 

S. M., Chapters 2 and 3. 

ter Haar, D., Revs. Mod. Phys. 27, 289 (1955). 

Tolman, Chapters 13 and 14. 

Wilson, Chapter 5. 
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CHAPTER 2 

FURTHER DISCUSSION OF ENSEMBLES 
AND THERMODYNAMICS 

2-1 Fluctuations. The general approach in Chapter 1 was, first, to 
set up postulates from which a procedure could be worked out for calcu¬ 
lating the mean values of mechanical thermodynamic variables, such as 
energy and pressure; then, to extend the treatment to include non¬ 
mechanical variables, such as temperature and entropy, by comparing 
corresponding statistical-mechanical and thermodynamic equations. Thus 
we now have at our disposal a set of general equations that can be employed 
to accomplish our primary task, namely, the calculation of thermodynamic 
functions from molecular properties. Beginning with the next chapter, 
we shall encounter many illustrations of such calculations. 

For the strictly thermodynamic purposes referred to above, only the 
mean values of mechanical variables need be considered. Yet any student 
of statistical mechanics will or should be curious about the extent to which 
mechanical variables fluctuate around their mean values. For example, 
in a closed, isothermal system (canonical ensemble), the system has a 
certain probability of being found in any given energy state E jy though 
the average value of the energy is 2?. What is the dispersion or spread of 
the energy probability distribution about 2?? However, curiosity does 
not provide the only motive for investigating fluctuations. The matter 
is of some practical interest as well. For example, (a) a statistical-me¬ 
chanical investigation of fluctuations justifies or explains, theoretically, 
why it is possible and legitimate to ignore fluctuations in thermodynamics; 
(b) a study of fluctuations leads us to the important conclusion [essentially 
equivalent to (a)] that in the calculation of thermodynamic properties 
of macroscopic systems, we can choose a statistical-mechanical ensemble 
to work with strictly on grounds of convenience, ignoring the actual kind 
of environment (isolated; closed, isothermal; etc.) the system of interest 
is in; (c) some important topics, such as light scattering and the theory 
of solutions, can be analyzed explicitly in terms of fluctuations; and finally, 
(d) fluctuation theory is very important in nonequilibrium statistical 
mechanics. 

Note that we have not mentioned fluctuations in nonmechanical vari¬ 
ables. The reason for this is the following. To define the mean value 
(or examine the extent of fluctuations about the mean value) of a prop¬ 
erty of a system which can exist in various states j with probabilities Pj, 
the property itself must be well defined in each state j. As was mentioned 
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at the beginning of Section 1-3, a property that meets this criterion is 
“mechanical,” by definition. For example, Ej, pj = —dEj/dV, N, and F 
are all well defined for a single quantum state, but S and T are not. 
Hence we can discuss fluctuations in E, N, etc., but not in S, T, etc.* 

We have remarked that thermodynamic functions calculated in statis¬ 
tical mechanics turn out to be independent of the ensemble used in the 
calculation. This is not true, however, of fluctuations. For each environ¬ 
ment (and therefore for each ensemble) the problem is different; in fact, 
even the variables which fluctuate are different. Hence, in a study of 
fluctuations, the actual environment of the system of interest must be 
taken note of and the appropriate ensemble used. For example, if the 
system is closed and isothermal, there will be fluctuations in p and E, 
but not in N and F (since values of N and V are prescribed and fixed), 
and these fluctuations must be investigated using a canonical ensemble. 

Each environment has a corresponding ensemble and partition function: 
eight possibilities are listed in Section 1-7 for a one-component system 
with a single external variable (F). It is clear then that there must be a 
great many fluctuation formulas which might be derived and discussed 
(see Problem 2-1, for example). We confine ourselves here to perhaps the 
three most important special cases (for a one-component system). 

Let us first consider the fluctuation in the energy of a closed Bystem 
immersed in a large heat bath (IV, F, T given and fixed). Because F and 
N are fixed, fluctuations in energy must be associated with heat exchange 
between system and bath. We use the canonical ensemble, of course. 
Now, as we shall soon verify, these energy fluctuations turn out to be very 
small indeed, so the probability distribution function for different energies 
is gaussian in form about the mean value 7? (Fig. 2-1). The dispersion 
or spread in this probability distribution may therefore be characterized 
completely by the standard deviation <Te, that is, the root mean-square 
deviation from the mean: 

<Te — [(# — $) 2 ] . ( 2 - 1 ) 

Before evaluating <Te explicitly, we note that 

(E - E) 2 = E 2 - 2 EE + (E) 2 = E 2 - (E) 2 . (2-2) 

This is necessarily a positive quantity. 

If we differentiate 

E e-W-WT = Ej(N, V)e~ £ > l/f ' V)lkT (2-3) 
i i 

* A less restricted point of view is taken, for example, by Landau and Lifshitz, 
Chapter 12. 
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Fio. 2-1. 
(schematic). 


Probability distribution in energy for a closed, isothermal system 


with respect to T, and then divide by Q, we find 


(1 _i_ % V _ 1 V 1 e-2 L-B,lkT 

\dT/r,if + QkT* ^ E ’ ~ QkT* Ej 


or 


E 2 - (E) 2 = (E - f) 2 = <r| = fcr 2 Cy. 


(2-4) 


From thermodynamics, we know that in general Cv — 0(Nk) and E = 
0{NkT). Therefore 


ffE = (kT 2 Cy) 112 

E E 


0(N~ m ). 


(2-5) 


Thus we have found that in a typical closed, isothermal system the 
standard deviation of the energy probability distribution is of the order 
of 10~ 10 E, an extremely small quantity. The probability distribution is 
therefore practically a Dirac 5-function at E — E. It is for this reason 
that in thermodynamics all values of E other than E can, in effect, be 
ignored. 

It may be helpful to look at these energy fluctuations from a more 
explicit and slightly different point of view, in a special case. According 
to Eq. (1-38), the probability Pe (Fig. 2-1) of observing a certain energy 
E is proportional to Q(AT, V, E)e~ B,kT . In order to have a concrete ex¬ 
ample before us, let us anticipate from Eq. (4-39) that the energy de¬ 
pendent factor in Q for an ideal, classical, monatomic gas is E 3NI2 . As 
mentioned in Section 1-6, the fact that 0 always increases with E can 
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be seen from Eq. (1-79) (T is positive). We can write, in our special case, 

P E = cx 3l,l2 e~ x , 

where x = E/kT and c is independent of x (we need not specify c more 
closely here). In the neighborhood of the most probable energy P*(=E), 
x is clearly of order 2V, since E* = O(NkT). Hence one of the factors 
involving x in Pe above is extremely large and increasing, and the other 
extremely small and decreasing. It is this feature which causes P B to have 
such a sharp maximum. To investigate the maximum, we need 


dlnP* 32V * _ 32V 

-di- = 0= 2^ -1 ' * -T’ 


d 2 \nP B 
dx* 


32V _ _ 2_ 
2x2 3 N ’ 


when x = x*. 


Then from 

to P, - In P, + i -»•)* + -, 

we find 


Pe = 

where 

_ 2 3 N(kT) 2 
<te - 2 - 


This is the conventional form for a gaussian distribution in terms of the 
standard deviation of the distribution. This result for agrees with 
Eq. (2-4), since Cv = ZNk/2 in this special case (Chapter 4). We also 
note that E* ~ x*kT = ZNkT/2, which is correct. 

Next we examine the fluctuation in N in an open, isothermal system. 
Here V, T, and are fixed. We differentiate 


7f £ Q(N, V, T)e Ifx,kT = 2 NQ(N, V, T)e N “ lkT (2-6) 

N N 


with respect to p, divide by S, and obtain 


or 


m - <fb‘= = kT (^) r , 


FelkT 


(2-7) 
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Since, in thermodynamics, p = 0(kT), cr% = 0(37) and <Tk/77 = 
0(F _1/2 ). This is the same order of magnitude for the relative sfluctua- 
tion as we found above for the energy of a closed, isothermal system. 
Indeed, it is the standard result in statistical-mechanical fluctuation 
formulas. We conclude that even an open system contains virtually a 
fixed number of molecules, 37, for given V, T, and p: the probability 
distribution in N is practically a 8-function. 

The right side of Eq. (2-7) can be put in more familiar thermodynamic 
form. From 

dfi = v dp (T constant), 

we have 

where v = V/N and p = N/V. Then 



Now we put Eq. (2-8) in Eq. (2-7) and find 

_ UTk 

\x) ~ v’ 

where k, the compressibility, is defined by 

V\dp/N,T 

For an ideal gas, k = 1/p and kTic/V = 1/77. 

V is constant, Eq. (2-9) is also a formula for the fluctuation in number 
density (p = N/V): 


(2-9) 


In view of the fact that 


(N - 77) 2 _ (p - P) 2 UY = kTjc 
(F) 2 (P) 2 \P/ V ' 


( 2 - 10 ) 


As our third case, let us investigate a system with N, p, and T fixed, 
but with fluctuations in volume. According to Eqs. (1—87) and (1-91), 
we have as the appropriate starting point, 


V A = ^ W, V, De~* VlkT , (2-11) 

v 
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where 

A = £ il(N, V, E)e- BlkT e~ pVlkT 
bTv 

= EQ(V, V, De~ prikT . (2-12) 

v 

That is, the probability of observing the volume V is 

Pv _ w.v,r»- v r . (2 _ 13) 

On differentiation of Eq. (2-11) with respect to p, and division by A, 
there results 

( _V* vOf~ pVlkT _— V V i Otr pVlkT 

\dp) N .T Afcry g “ A*ry KQ 

or 

(D*_ (2-14) 

Hence the formula here for oy/F is the same as that for <t n /N in an open 
system (Eq. 2-9). 

Finally, we mention an exception to the negligible fluctuations we have 
been encountering above. Consider Eq. (2-9), for example. At a critical 
point or when two phases exist together in the system, (dp/dV) NtT is 
essentially zero and k infinite. Hence the fluctuations are large rather 
than negligible. To be a little more specific and exact, in a two-phase 
system w ith the two number densities pi and p 2 , p in the expression 
(p — p) 2 can range from p t to p 2 (rather than being centered very closely 
around p), so that (p — p) 2 is now of the order of (p) 2 itself. Hence 
a,/p in Eq. (2-10) is of order unity rather than N~ 112 . That is, the 
standard deviation is of the same magnitude as the mean value itself. 

Fluctuations in density at the critical point are responsible for the 
well-known critical opalescence phenomenon. 

2-2 Thermodynamic equivalence of ensembles. The formulas derived 
in the preceding section are typical in showing that fluctuations about 
mean values are so small they can be ignored in thermodynamics except 
under very special circumstances (even in a two-phase system, fluctua¬ 
tions within each phase separately are normal, Le., thermody na mically 
negligible). For example, in the grand partition function 

S - 2 Q(N, V, T)e N - lkT (2-15) 

N 
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for a one-component open system, the only values of N which have an 
appreciable probability of being observed experimentally are those that 
deviate only negligibly from the mean value 7?. Therefore, just as we 
found in Appendix II, we can replace In £ by the logarithm of the largest 
term in the above sum, without making a detectable error to terms 
of thermodynamic order of magnitude. To be more specific: both 
In £ (=pV/kT) and the logarithm of the largest term in the sum are of 
order 27, and Eq. (2-7) tells us that there are 0(27 I/2 ) terms in the sum 
of the same order of magnitude as the maximum term; hence the situa¬ 
tion is completely analogous to that in Appendix II [see Eq. (II—11)]. 

Let us proceed to find the maximum term in £, for the result is rather 
interesting. Let 

ty(V, T, p) = Q(N, V, T)e N “ lkT . 

Then 



If we denote by N* the value of N satisfying Eq. (2-16), ( v * is the 
maximum term in £. Then 

InE = g = In ty = In Q(N*, V,T)+^, 
or 

N*p - pV = A(N*, V, T) = —kT In Q{N*, V, T). (2-17) 

Equation (2-16) determines N* (=27) as a function of V, T, and p. But 
if we take the alternative point of view that the independent variables 
are ~N*, V, and T, and that Eq. (2-16) gives p as a function of N*, V, 
and T, tJhen Eqs. (2-16) and (2-17) are just the canonical ensemble 
equations (1-36) and (1-31), respectively. In other words, application 
of the maximum-term procedure, which is legitimate because of the small 
fluctuations in N, causes the grand ensemble to degenerate into the 
canonical ensemble. This conclusion can be verified by noting further 
that the grand ensemble equations for 5 and p [(1-66) and (1-68)] also 
go over into the corresponding canonical ensemble equations [(1-33) and 
(1-34)] when In £ is replaced by In ty* (Problem 2-2). 

For practical thermodynamic purposes, then, there is no distinction 
between a grand ensemble and a canonical ensemble. In a given prob¬ 
lem one can choose between them simply on the basis of mathematical 
convenience. 

Let us digress to note that we can write, as in Eq. (II—8), 
t N = Q(N, V, T)e N ' lkT = tye- 1 "-"''' 13 *", 
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where <Tn is given by Eq. (2-7). The grand partition function sum can 
then be replaced by an integral: 

In S = In + In f*” e~ x l2vif dx 
J —00 

= In <N* + In [(2ir) 1/2 <Ttf], 

where x = N — N*. The second term is of order In 7J, which is negli¬ 
gible compared with the first term. 

The argument above can be applied to any of the partition functions, 
(1-83) through (1-88). As one further example, observe the degeneration 
of the canonical ensemble into the microcanonical ensemble: 

In Q = In ^ t B — In t B *, 

B 

where 

t B = 0(1V, V, E)e' ElkT . 

Then E*(N, V, T)(=T2 ) is found from 

a In t E = 0 = (WA _ J_ 
dE \ dE )n.v 

Also, 

\nQ= -±= InQ (N, V, 
or 

S = fclnO(AT, V,E*). 

Equations (2-18) and (2-19) will be recognized as microcanonical ensemble 
relations. 

In thermodynamics the functional relations between the thermodynamic 
variables of a system are independent of the environment (open, closed, 
isobaric, isothermal, etc.). Another way of saying this is that the choice 
of independent thermodynamic variables is arbitrary and not prescribed 
by the environment. In statistical mechanics, just as we should expect, 
we have now come to the same conclusion: regardless of environment, we 
can select whichever ensemble or partition function (and therefore inde¬ 
pendent thermodynamic variables) we wish in calculating thermodynamic 
properties; the results must be independent of the choice. 

Since the entropy is a particularly important and interesting thermo¬ 
dynamic function, we add a few comments concerning it. The canonical 
ensemble expression for S, Eq. (1-30), is 

S = -k J Pi In Pj. 

i 


(2-18) 


(2-19) 


( 2 - 20 ) 
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Now we have found that in a closed, isothermal system, the probability 
that the system has an energy departing appreciably from E is virtually 
zero. Hence we may assume that the only P/s in Eq. (2-20) which 
make a significant contribution to the sum are those associated with 
E (=E*, above). The number of quantum states with energy E is 
Sl(N, V, E), so that each P, = l/Sl(N, V, E) on this assumption. Then 

= fc In Q(N, V, E), (2-21) 

in agreement with Eq. (2-19). This confirms our assumption. The 
reader will have noticed that this argument is practically the same as 
that used in deducing Eq. (1-74) as the starting point for a discussion 
of the microcanonical ensemble. But the point of view here is quite 
different. In Section 1-6 we had to either mentally pick out those sys¬ 
tems in a canonical ensemble with a certain E, or artificially limit the 
accessibility of quantum states to those with energy E. But we have 
found from our study of fluctuations in this chapter that these restricting 
devices are really unnecessary: a canonical ensemble (in fact, any en¬ 
semble) is, so to speak, by its own choosing virtually a microcanonical 
ensemble. 

Similarly, in the grand ensemble relation (Eq. 1-56), 

S= P,(N) In Pj(N), (2-22) 

>.N 

the only important Pj(N)’s are those associated with E and 77. The 
number of these states is 0(77, V, E). Then 

S = kin 0(77, V, E). (2-23) 

Thus the relation «S = k In 0 (N, V, E) has quite general validity. It 
is restricted in principle but not in practice to isolated systems. But we 
must take for N, V, and E in Q(JV, V, E) the mean or most probable 
values (when fluctuations are possible), which are themselves functions 
of the independent variables of the system. With this understanding we 
can make a general correlation between the magnitude of S and the num¬ 
ber of available quantum states 0. The greater the degree of randomness 
or disorder in a system, the larger 0 and therefore S. We shall encounter 
many explicit examples in later chapters, but we can anticipate, on 
qualitative grounds, that, for example: fi and the entropy increase in the 
sequence crystal —»liquid —» gas for the same substance at the same 
pressure; the entropy of a group of molecules is less than the entropy of 
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the dissociated atoms from which the molecules are made; molecules with 
small vibrational force constants (“weak interatomic springs”) will have 
higher entropies than those with large force constants; a group of gas 
molecules occupying a large volume has a higher entropy than when oc¬ 
cupying a small volume; etc. 

2-3 Second law of thermodynamics. In this section we discuss the 
second law of thermodynamics from the standpoint of statistical mechanics. 
First we consider an isolated .system. 

If an isolated system changes its thermodynamic state infinitesimally, 
the change in entropy in the process satisfies the relation dS > 0. This 
is one way to state the second law of thermodynamics. The inequality 
holds for any spontaneous, (irreversible) change. After the isolated sys¬ 
tem has exhausted all possible spontaneous changes (each with dS > 0) 
available to it, the entropy will have reached a maximum value. A further 
infinitesimal change in state will now be reversible, and the equality will 
hold, dS = 0. Similarly, in any finite spontaneous change in thermo¬ 
dynamic state (i.e., both initial and final states are equilibrium states) 
in an isolated system, AS > 0. It is this inequality in particular which 
we examine. 

Probably no system is ever in complete equilibrium with respect to 
all possible processes. In practice, we require in thermodynamics only 
that the system be in equilibrium with respect to those rate processes 
with half-lives short compared with the time available for an experiment. 
Processes so slow as to proceed to a negligible extent during a thermo¬ 
dynamic measurement present no complication and can be ignored, even 
though the system is not in equilibrium with respect to such processes. 
For example, in the absence of a catalyst, the equilibrium properties of 
a mixture of nitrogen and hydrogen gases can be studied without taking 
into account the possibility of formation of ammonia molecules, because 
the rate of formation of these molecules from nitrogen and hydrogen is 
extremely slow. In quantum-mechanical language, we would say that 
only those quantum states associated with the presence of only nitrogen 
and hydrogen are, in fact, accessible states; states corresponding to the 
existence of some ammonia molecules in the system are inaccessible. In 
effect, there is a restraint (of chemical kinetic origin, in this case—an 
activation energy or, rather, activation free energy) in operation which 
limits the accessibility of quantum states. 

However, if we start in this example with a mixture of nitrogen and 
hydrogen gases at equilibrium in an isolated system and then add a 
small amount of catalyst (not considered part of the system, for sim¬ 
plicity): (a) some ammonia molecules will be formed spontaneously; 
(b) the states accessible to the system are now extended, by removal of 
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Adsorbing 

Barrier Barrier surface 




Fig. 2-2. (a) Gas confined to one side of container by a removable barrier, 
(b) Gas restrained from contact with adsorbing surface by removable barrier. 

the restraint,* to include a group of formerly inaccessible states as well 
as the originally accessible ones; and (c) after the new equilibrium point 
is reached, now including the chemical equilibrium, we must have AS > 0 
for this spontaneous process, according to the second law. 

Quite generally, any spontaneous thermodynamic process in an isolated 
system can be viewed in this way: the initial equilibrium state of the 
system includes a certain set of accessible quantum states but excludes 
other (inaccessible) quantum states; the spontaneous process leading to 
the final equilibrium state occurs because of, or is made possible by, the 
removal of one or more restraints, which makes an additional group of 
quantum states accessible to the system. Further examples are: (a) a gas 
is confined to one side of a barrier (Fig. 2-2a), and the barrier (restraint) 
is then removed; and (b) gas molecules are initially prevented by a barrier 
from coming in contact with an adsorbing surface (Fig. 2-2b), but then 
the barrier is removed. 

The removal of a restraint can probably always be analyzed in terms 
of the reduction or obliteration of an activation energy (i.e., potential 
barrier) or free energy. Thus the function of a catalyst for a chemical 
reaction is to reduce the activation free energy of the reaction (see 
Chapter 11). In cases (a) and (b) in the previous paragraph, the physical 
barrier presents an essentially infinite potential barrier to the passage of 
molecules, which becomes zero when the physical barrier is taken out. 

Suppose an isolated system is characterized by N, V, E (N might be 
a set of numbers of molecules and V a set of external parameters) and in 
its initial state has accessible to it 12(2V, V, E) quantum states. Further, 


* We have not defined the word "restraint” because we are using its conven¬ 
tional meaning. "Removal of restraint” always implies that the number of 
accessible quantum states increates. Mayer and Mayer (pp. 81-85) use the word 
"inhibition” instead of “restraint.” 
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let the removal of a restraint initiate (on the thermodynamic level) a 
spontaneous process and make available (on the molecular level) addi¬ 
tional quantum states so that the total number of states becomes 
Q'(N, V, E). That is. O' — £2 new states are added to the original G. 
Necessarily, O' > 0. Since S = fc In SI in the initial equilibrium state 
and S' = k In Q' in the final equilibrium state, we have, for the spon¬ 
taneous process, 

AS = S' - S = k\n^ > 0. (2-24) 


This is the statistical-mechanical version of the second law of thermo¬ 
dynamics for an isolated system. It is beautifully simple, following directly 
from the ideas of the number and accessibility of quantum states. 

It is interesting to calculate from the above considerations the prob¬ 
ability of the spontaneous occurrence in an isolated system of a process 
with a negative entropy change, in contradiction to the second law of 
thermodynamics. For example, suppose we have gas occupying the 
whole container (O' quantum states) in Fig. 2-2 (a) and we wish to calcu¬ 
late the probability of observing the gas occupying (spontaneously) only 
the left half of the container (G quantum states). From thermodynamics 
we know that the entropy change in this process for an ideal gas of 
N molecules is —Nk In 2. Now in accordance with the notation of the 
preceding paragraph, of the total fi' quantum states accessible when the 
whole container is available to the gas molecules, 0 of these states corre¬ 
spond to molecules actually being present only in the left half of the 
container. Thus, since all G' quantum states have equal probability of 
occurrence, the probability that all molecules will be found in the left half 
is O/G'. If AS represents the (positive) entropy change for the process 
“left half” —* “whole,” then (Eq. 2-24) 


a 

o' 


,-tsik 

K t 


AS > 0 . 


(2-25) 


For an ideal gas AS = Nk In 2, and the probability of observing a viola¬ 
tion of the second law in this case is n/Q' = 1/2 N , a very small number 
of order l/lO 10 ’ 0 . This result can also be deduced from simple prob¬ 
ability considerations: the probability that any one of the N molecules 
will be found on the left is 1/2, so the probability that all will be found 
on the left is (1/2) N (ideal gas). 

Equation (2-25) is, of course, a general equation applicable to other 
cases in which the second law would be violated. But as significant 
experimental values of AS are always of order Nk, the probability $1/0' 
is always very small, of order e~ N . That is, a nonnegligible entropy in¬ 
crease always corresponds to a tremendous increase in the number of 
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accessible quantum states: 12' » fl. Thus, we conclude that while it is 
not rigorously impossible for a spontaneous process with a negative entropy 
change to occur in an isolated system, the probability of such an occur¬ 
rence is so slight that the process can be considered impossible in actual 
practice (i.e., operationally), and this is all that thermodynamics requires. 

We turn now to a statistical-mechanical analysis of the second law of 
thermodynamics in closed, isothermal systems. The situation here is a 
little more complicated, but of more practical interest since isolated sys¬ 
tems are exceptional in experimental thermodynamics. Again we have 
to consider a spontaneous process initiated by the removal of a restraint, 
but since now the system is immersed in a heat bath, initial and final 
equilibrium states have the same temperature rather than the same 
energy (as in an isolated system). The three examples above (chemical 
reaction, diffusion, adsorption) also serve as examples here, with the 
understanding that in each case the system is put in thermal contact with 
a large constant-temperature bath. In all these cases and in general, as 
is well known, the second law of thermodynamics takes the form AA < 0 
for a finite, spontaneous process in a closed, isothermal system. It is 
this inequality that we now wish to deduce from statistical mechanics. 

A system in a heat bath has fluctuations in E, as we have seen in 
Section 2-1. For given N and V, many energy levels E have to be con¬ 
sidered. Let J2(AT, V, E) be the number of quantum states accessible to 
the system, in the initial thermodynamic state, for a particular energy 
level E. When the restraint is removed, causing the spontaneous process 
in question to occur, the number of accessible quantum states for energy E 
may increase, but cannot decrease since all the Q original states are still 
available to the system. That is, Cl'(N, V, E) > Q(AT, V, E), and this 
will be true for every E. Of course, in principle, some values of E may 
be energy eigenvalues (i.e., energy levels) in the final state but not in 
the initial state. If so, 0 = 0 and fl' > 0 for such an E. In the initial 
state, 

Q = V,E)e~ BltT , (2-26) 

E 

and in the final state, 

Q' = Y, °'(tf > v > E)e~ BlkT , (2-27) 

E 

where every possible E in the final state is included in the two sums. 
Now no term in the sum in Eq. (2-26) can be larger than the correspond¬ 
ing term (same E) in Eq. (2-27), in view of the relation 12' > $2 just 
mentioned above. Hence, since A = —kT In Q, 


Q' > Q and A A < 0. 


(2-28) 
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The above argument provides a general statistical-mechanical justifica¬ 
tion of the equilibrium criterion A = minimum, dA = 0, for a closed, 
isothermal system (i.e., the system runs through all possible spontaneous 
processes—each with A A < 0—and finally settles, at equilibrium, on 
the lowest available value of A). A frequent theoretical application of 
essentially this principle might be mentioned. Suppose that in a statis¬ 
tical-mechanical model of a thermodynamic system a continuously variable 
parameter £ enters, in addition to the independent thermodynamic vari¬ 
ables. If we are able to find, from the model, Q(N, V, T, |) and therefore 
A(N, V, T, £), then the appropriate value of £ to assign to the real system 
is that value which minimizes A (maximizes Q) for given N, V, T. An 
example would be the fraction £ of holes in a lattice model for a crystal 
or liquid, utrith N, V, T fixed. Or, £ might be the “degree of advancement” 
of a chemical reaction, or the fraction of molecules in one or the other 
phase of a two-phase system. 

It may be instructive to examine A E and A S separately for spontaneous 
isothermal processes with A A < 0. Consider first the case where A E — 
E' — E is positive. An example: initial state = a gas of hydrogen mole¬ 
cules at T; final state = an equilibrium mixture of hydrogen molecules 
and atoms at T. The spontaneous process here is the dissociation of some 
hydrogen molecules into atoms, a process assumed to have a negligible 
rate in the initial state. E' > E because the newly accessible quantum 
states (belonging to a mixture of hydrogen molecules and atoms) in the 
final thermodynamic state lie in general at higher energies than the 
original quantum states (belonging to hydrogen molecules only). The 
bond energy of the hydrogen molecule is of course primarily responsible 
for this. From Eq. (2-21), in the initial state 

S = kin G(AT, V, E), (2-29) 

and in the final state 

S' = k In O'(AT, V, E'). (2-30) 

In the example, AT is the number of hydrogen molecules started with 
(i.e., in the initial state). Now the function Q(N, V, E) always increases 
with increasing E (holding N and V constant), as we have already seen 
(Eq. 1-79). Therefore, since E' > E in this case, 

G(AT, V, E') > G(AT, V, E). 

But also, as already emphasized for any E, 

Q '{N, V, E') > G (AT, V, E'). 

O'(AT, V, E') > Q(N, V, E), 


Hence 
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Fig. 2-3. Spontaneous isothermal process with A A < 0, A E > 0, and 
AS > 0. 



Fig. 2-4. Spontaneous isothermal process with A A < 0, A E < 0, and 
AS > 0. 



Fig. 2-5. Spontaneous isothermal process with A A < 0, A E < 0, and 
AS < 0. 
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and consequently, from Eqs. (2-29) and (2-30), 


AS = S' - S > 0, (2-31) 

as is in fact obviously required if we are to have AA < 0 and AE > 0. 
This situation is summarized diagrammatically in Fig. 2-3. 

On the other hand, if AE < 0 (as would occur if the group of new 
quantum states, inaccessible in the initial thermodynamic state, lies in 
general at lower energies than the original group of states), AS may be 
positive or negative, as shown in Figs. 2-4 and 2-5 respectively. 

We might add the following quite general comment on closed, iso¬ 
thermal systems. Such a system will be thermodynamically more stable 
the lower A, the larger. Q, and therefore, in view of the form of Eq. (2-26) 
for Q (a sum of terms, all positive), the lower the energy levels E available 
to the system and the denser (Q) the quantum states, especially at low 
energies. 

Because of its importance, we should also mention explicitly the thermo¬ 
dynamic condition AF < 0 for a spontaneous process occurring in a 
closed system at constant temperature and pressure. The partition 
function in the initial state is 

A = £ Q(N, V, E)e- BlkT e- pvlkT . 

E.V 

In the final state, replace A by A' and 0 (N, V, E) by Q'(N, V, E). For 
any particular V and E, W(N, V, E) > Q{N, V, E), as before. Therefore 

A' > A and AF < 0, (2-32) 

since F = —kT In A. This is the statistical-mechanical basis for the 
equilibrium condition F = minimum, dF = 0, in a closed system at con¬ 
stant pressure and temperature. From this condition, well-known (thermo¬ 
dynamic) results follow, such as An = 0 for chemical (see Section 10-1) 
or phase equilibria, where n = chemical potential. Having shown once 
and for all the statistical-mechanical basis of the criterion F = minimum 
(and therefore Ap = 0), we shall make free use of this condition in ap¬ 
plications and not feel obliged to rederive it in special cases. 

It will be apparent to the reader that the argument leading to 
Eqs. (2-28) and (2-32) can be extended to any of the partition functions, 
Eqs. (1-83) through (1-88). The characteristic function on the right side 
of any of these equations will be maximized at equilibrium if the inde¬ 
pendent variables are held constant (e.g., —(3A is a maximum at constant 
N,V,p). 
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2-4 Third law of thermodynamics. Our object here is to present the 
statistical-mechanical basis of the third law of thermodynamics. But 
first we review very briefly the status of the third law in pure thermo¬ 
dynamics, and its relation to certain thermodynamic integration constants. 
As usual, there are some fine points that we shall omit or pass over lightly 
in this introductory discussion. The reader is referred to the Supple¬ 
mentary Reading list at the end of the chapter for fuller discussion. 

It has been found experimentally that AS —» 0 as T —» 0 for a large 
number of isothermal processes involving pure phases. Examples are: 
(a) a phase transition at temperature T between two different crystalline 
forms of the same substance (e.g., tin or silicon dioxide); (b) the phase 
transition between liquid and solid helium at T; (c) a chemical reaction 
between pure crystals at p and T (e.g., Pb + I 2 —> Pbl 2 ); and (d) a 
crystal at T and x —* crystal at T and x', where x represents one or more 
independent thermodynamic variables other than T (e.g., x = pressure, 
volume, or magnetic field strength). The experimental result, A S —* 0, 
for systems of this type is presumed to be general (with some under¬ 
standable exceptions) and is not a consequence of the first two laws of 
thermodynamics. Hence the third law of thermodynamics has been pro¬ 
posed. It can be stated as follows: for any isothermal process involving 
only pure phases in internal equilibrium (or including frozen metastable 
phases if the process does not disturb the frozen equilibrium), 


lim AS = 0. (2-33) 

r-»o 


This statement does not exclude, for example, processes in which meta¬ 
stable atomic nuclei are present but unchanged in the process, or in which 
negligibly slow chemical reactions are thermodynamically possible but do 
not occur. But it does exclude, say, the process glass crystal. Also, 
the word “pure” does not exclude isotopic mixtures if the mixing is un¬ 
affected by the process. 

The term “metastable” is closely related to the terms “accessible" and 
“inaccessible” quantum states. Metastability implies the possibility of 
another, more stable, thermodynamic state which, however, is unavail¬ 
able because of the inaccessibility of the associated quantum states (see 
Section 2-3). 

An essentially equivalent but slightly more general statement of the 
third law* is: it is impossible by any procedure, no matter how idealized, 
to reduce any thermodynamic system to the absolute zero of temperature 
in a finite number of operations. This is called “the principle of the un- 


* See Fowler and Guggenheim, pp. 224-227. 
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attainability of absolute zero.” We shall not use the third law in this 
form, or establish the connection with Eq. (2-33). 

To illustrate the use of the third law in thermodynamics, consider, as 
an example, the simple gaseous chemical reaction 

A —* B 

occurring at p and T, and meeting the metastability restrictions of 
Eq. (2-33). From experimental heat capacities (extrapolated to T — 0) 
and heats of phase transitions, we have the well-known equations 

H a (T) = Ha( 0) + h A (T), (2-34) 

Sa(T) = Sa(0) +0a(T), (2-35) 

where 

f T 

h A (T) = £ A HLn* + / Cp(T') dr, 

JO 

0 a(T) = £ A sL* + f o GtypdT’. 

Ha( 0) and Sa( 0) are integration constants, and the sums are over the 
phase transitions encountered at p between T' = 0 and T' = T. Similar 
equations can be written for B. Integration of dF A /dT — — S A gives, 
further, 

Fa(T) = H a ( 0) - TS a ( 0) - f T g A (T')dT’. (2-36) 

Jo 

The integration constant Fa( 0) is equal to H A ( 0), since the relation 
F = H — TS becomes F — H at T = 0. Then for the reaction A —*• B, 

A H(T) = H b (T) — H a (T) — Aff(0) + A h(T), (2-37) 

A S(T) = AS(0) + Ag(T), (2-38) 

AF(T) = AH(0) - T A5(0) - f Ag(T')dT'. (2-39) 

Jo 

The value of the constant H A (0) depends on the location of the zero of 
energy, and this is arbitrary for any single substance. But in a process 
such as A —* B, the choices of H A { 0) and Hb( 0) have to be self-consistent. 
That is, A H (0) is not arbitrary, but is the limit (T —* 0) of an experimental 
quantity. In general, A/f(0) 0. The third law simplifies Eqs. (2-38) 

and (2-39) considerably by asserting that AjS( 0) = 0 in these equations. 
There is no operational (experimental) justification—since thermodynamics 
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gives us information about entropy differences only—for going beyond 
this and simplifying Eqs. (2-35) and (2-36) by setting Sa( 0) = 0 as well. 
But this step is obviously convenient and is consistent with the require¬ 
ment AS(0) = 0, so it is adopted as a convention for pure phases (crystals, 
except for helium). That is, Sa(T) is set equal to the experimental (in¬ 
cluding an extrapolation) quantity Qa{T). 

With this condensed thermodynamic survey as background, we turn 
now to the statistical-mechanical side. The first point that requires 
comment is the integration constant c in Eq. (1-21), where we remarked 
that c is independent of the thermodynamic state of a closed system. This 
is a sufficiently inclusive observation for our present purposes, provided 
we understand “state” in a general way. For example, any of the iso¬ 
thermal processes (a) through (d) mentioned at the beginning of this 
section may be regarded as involving changes in the thermodynamic state 
of a closed system. In every case, one can in principle proceed reversibly 
from the initial state to the final state by infinitely slow changes in one or 
more parameters on which the state depends (volume, pressure, degree 
of advancement, etc.). Thus for all isothermal processes embraced by 
the third law of thermodynamics (Eq. 2-33), the constant c is independent 
of these parameters of state, and hence for such a process (the prime 
refers to the final state), 

AS = S' - S = (kina' + </) - (fcIn0 + c) 

= k A In Q, Ac = 0. (2-40) 

The value assigned to c clearly has no operational significance at all, for 
it always cancels on taking differences. Hence we can put c = 0 if we 
wish, as in Chapter 1. 

Having thus disposed of Ac, the remaining question is whether statistical 
mechanics predicts, as is required for agreement with Eq. (2-33), that 

lim A In Q = A In O 0 = 0 (2-41) 

r-*o 

for the processes included in our above statement of the third law. The 
subscript zero refers to the ground state, the quantum state of interest, 
since T — * 0. If we include only the degrees of freedom associated with 
the electrons and centers of mass of the nuclei (electronic, vibrational, 
rotational, translational), we can say that quantum mechanics predicts 
or experiment indicates in known cases that the ground state of a pure 
crystal or liquid helium is nondegenerate.* For these degrees of freedom, 
then, Q 0 = Qg = 1. Now, fortunately, this is a much stronger statement 


* See Fowler and Guggenheim, pp. 199-205. 
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than Eq. (2-41) requires. For one thing, as far as thermodynamic con¬ 
sequences (such as the third law) are concerned, a value of G 0 in the range 
1 Qo e N is indistinguishable from G 0 — 1» as can be seen from 
Eq. (2-40) [AS = 0(Nk)]. This is a considerable safety factor. More 
important, if we include intranuclear degrees of freedom, about which 
there is still a vast amount of ignorance, we have to expect in general 
that In Q 0 and In are not thermodynamically negligible, owing to 
degeneracy of the nuclear ground state or possibly of a metastable state. 
But the third law is saved despite this because we expect Q 0 = Go (ordinary 
thermodynamic changes do not affect the nuclear state), and hence 
A In fi 0 = 0. Similarly, if we include the entropy of isotopic mixing, 
which is present in most real thermodynamic systems, lnG 0 and In Qq 
will not be negligible, but still G 0 = Go- The same remarks can be made 
yet again concerning any type of metastability, allowed in our statement 
of the third law, which introduces nonnegligible degeneracy at 0°K. 

We come to the conclusion, then, that for the same systems which obey 
Eq. (2-33) thermodynamically, we anticipate on molecular grounds that 
Eq. (2-41) will also be obeyed. This is all that we need for a statistical- 
mechanical understanding of the third law. 

But we also conclude that In Go and In Gy separately are in general 
unknown and may not be thermodynamically negligible. This conclusion 
destroys any hope of setting up a scale of truly “absolute entropies” for 
single substances (in addition, there is the arbitrariness of the additive 
constant c). 

In spite of the above remarks, it is conventional to introduce, for pure 
substances, what might be called “practical absolute entropies.” If we 
exclude any substance which possesses at 0°K frozen metastability that 
“thaws out” between 0°K and T°K (we shall encounter some of these 
exceptions in Chapter 9), we have 

S(T) = c + kln^ + fclnQo = 5(0) + g(T), 
or 

S(T) - c - klnQo = = f(T). (2-42) 

In these equations one might just as well set c = 0. In Eq. (2-42), the 
expression on the left is the definition of the practical absolute entropy, 
the second expression is the statistical form of it, and the third is the 
thermodynamic form (Eq. 2-35). In other words: (a) c + k In Q 0 has the 
same significance as 5(0) in Eq. (2-35), and both are nonoperational 
quantities; (b) O/O 0 is the number of quantum states associated with the 
electrons and centers of mass of the nuclei (i.e., with the electronic, vibra- 
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tional, rotational, and translational degrees of freedom); and (c) the G/Q 0 
quantum states referred to in (b) are the ones that are “excited ” in ordinary 
thermodynamic systems when the system absorbs heat (DQ* = T dS) to 
increase its temperature from 0°K to T° K. 

For simplicity, in the rest of the book (except in Chapter 9) we shall 
write Eq. (2-42) simply as 

S(T) = fclnO(T) = g(T), (2-43) 

with the understanding that we are referring to the “operational” part 
of the entropy only: we count in Q only those quantum states which are 
actually “excitable” and exclude those which are not (nuclear ground 
state, isotope mixing, any metastability frozen over the whole temperature 
range). As we pointed out following Eq. (2-41), this Q = 1 in the ground 
state, and hence this S = 0 at T = 0°K. 

The entropy calculated as k In 0 in Eq. (2-43) is often called the “spec¬ 
troscopic entropy,” since energy levels from spectroscopy are used in 
practice; and the entropy measured as g (with an extrapolation of C p 
to 0°K) is called the “calorimetric entropy.” We shall compare these 
quantities in Chapter 9, and discuss some exceptions (owing to “thawed” 
metastability) to the expected equality. 
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Problems 

2-1. Note that Eqs. (1-83) through (1-85) can be written in the form 
In T"1 0«?i, Cfa, Gz)e tl ° l - <fii(gu Gz, Gz) — ^ 

Ol 

where g is intensive and G extensive. Show that 



Also, Eqs. (1-86) through (1-88) can be written 

in G((7i, Gz, G ^ 3)e ,1<?, e” <?, •* <rii(gi, 02 , Gz) “ ^ + + QaOz- 

Ol.Ol 

Show that 



(Page 34.) 

2-2. Show that the grand ensemble equations for S and p go over into the 
canonical ensemble equations for the same variables when InS is replaced by 
the logarithm of the largest term in S. (Page 39.) 

2-3. Prove the following: if the probability distribution for G is gaussian 
about <? with <ro/7) very small, then the probability distribution for 1/G about 
1/7} is gaussian with 

<ri/o £o 

1/7} “ 3' 

Show that this result and Eq. (2-14) lead to the density fluctuation for¬ 
mula (2-10). 

2-4. Show that in the N, p, T ensemble (Eq. 2-12), 

U* — (Rf - kT 2 C p . 


2-5. Show that in a two-component, open, isothermal system, 

5 ^ (Si,,, - tT (Sl,„ ■ 

2-6. Prove that in the canonical ensemble (see Problem 1-2), 

- * , '[C 4 L+"] - 
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2-7. Show that the canonical ensemble equations for p and ju go over into the 
microcanonical ensemble equations for the same variables when In Q is replaced 
by the logarithm of the largest term in Q. 

2-8. If one combines Eqs. (2-34) and (2-35) to obtain Fa{T) — Ha{T) — 
TSa(T), the resulting expression appears to differ from Eq. (2-36). Prove that 
the two equations for Fa(T) are equivalent. 

2-9. Justify the statement above Eq. (2-1) that the energy distribution is 
gaussian. 
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Fowler, R. H., Statistical Mechanics, Cambridge, 1936 (Second Edition), 
Chapter 20. 

Fowler and Guggenheim, Chapter 5. 

Greene, R. F., and Callen, H. B., Phys. Rev. 83, 1231 (1951). 

Kittel, Part 2. 

Landau and Lifshitz, Chapter 12. 

Mayer and Mayer, Chapters 3 and 4. 

SchrSdinger, Chapter 3. 

S. M., Chapters 3 and 4. 

Tolman, Chapter 14. 

Wilson, Chapter 7. 
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Part II 

Systems Composed of Independent 
Molecules or Subsystems 





CHAPTER 3 


GENERAL RELATIONS FOR INDEPENDENT 
DISTINGUISHABLE AND INDISTINGUISHABLE 
MOLECULES OR SUBSYSTEMS 

The simplest problems in statistical mechanics are those concerned with 
systems composed of molecules, groups of molecules, or degrees of freedom 
which are effectively independent of each other. Part II of this book is 
devoted to problems of this type, except for Chapter 6, which returns to 
more general considerations. A lack of such independence is usually 
associated with two causes: (a) intermolecular forces (treated primarily 
in Part III), and (b) symmetry restrictions on quantum-mechanical wave 
functions (discussed in Part IV). Of course, no statistical-mechanical 
system can consist of strictly noninteracting molecules (or subsystems), 
for then the system could not achieve internal equilibrium. By ‘‘inde¬ 
pendent” we shall imply “weak interaction” only: the molecules or degrees 
of freedom interact sufficiently to maintain thermal equilibrium by energy 
exchange, but not to such an extent as to require taking into account 
intermolecular forces, etc. The “weak interaction” may be direct (e.g., by 
collisions) or indirect (e.g., via the walls or a heat bath). The most obvious 
example is an ideal gas; the density is low enough so that intermolecular 
forces make a negligible contribution to thermodynamic properties (e.g., 
the equation of state), but equilibrium is still maintained by intermolecular 
collisions and/or by collisions with the walls. 

The “subsystems” of a system, other than molecules, referred to in the 
chapter title and above might be, for example: (a) different degrees of 
freedom in the same molecule (translation, rotation, etc.); (b) independent 
vibrational modes in a monatomic crystal; (c) molecules adsorbed on 
independent groups of adsorption sites on a solid surface; etc. 

Before turning in the next chapter to our first specific example of a 
system of independent molecules, we set down in the present chapter 
certain relationships of quite general validity for systems of this type. 
A few brief comments will be made about possible applications, but 
details in every case will be reserved for the appropriate chapter later in 
the book. 

3-1 Independent and distinguishable molecules or subsystems. Let 
H be the classical Hamiltonian function for the macroscopic system under 
consideration. If the system is composed of independent molecules or 
subsystems, by definition H will be given by a sum of independent 
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contributions, 

H = H a + H b + • • •, (3-1) 

where H a , Hb, etc., refer to individual molecules, degrees of freedom, etc. 
Similarly, for the Hamiltonian operator, 

3C = 3Ca + Xb + • • • . (3-2) 

Let the eigenvalues and eigenfunctions of 3C a , 3C t , etc., be denoted by 
««, (b, ■ • • and ta, tb, . • •, respectively. Then we try, as a solution of 
Xt = Et, the product t = ta tb • •., and find 

Kt = (3C, + K b + ■' -)Mb- ■ ■ 

— Me • • • Kata + tat a * * * K»frb H- 

= Me • • • fata + Me' ' ' Mb H- 

= («o + + ’ * ‘)t ~ Et- 

That is, the possible energy eigenvalues for the whole system are of the 
form 

E — e a + €» + •••, (3-3) 

just the sum of the separate energies t a , as we might expect. The 

essential practical point here is the following: for a system of independent 
molecules or subsystems, we do not have to solve the complete Schrddinger 
equation X t = Et with O(10 20 ) coordinates, but only the separate 
equations X a ta = Ma, etc-, each with one or relatively few coordinates. 
Thus we have a reduction of a many-body problem to a one- (or few-) body 
problem, because of the form of Eq. (3-1). 

For one molecule or subsystem at a time, let us define partition functions 
of the canonical ensemble type, 

fl« = 2 e ~* ailkT < 96 = 2 e~* h > ,kT , etc., (3-4) 
) i 

where the sums are over molecular or subsystem energy states j (an 
energy level with degeneracy is represented by several terms in the sum). 
Next we notice that the product of all the g’s generates all possible values 
of E. For example, suppose there are just two subsystems, one with three 
energy states and one with two states. Then 

9«96 = (e~* allkT + e -“» /ir + <T“* ,tr )(«-*», /tr + e- thi ' kT ) 

_ e — («al+«M)/*r _|_ g -(«al+« 62 )/tr _j_ g — 

J £*“(*a2-H&2)/W (<a3+«6l)/ir g—(<o8-H62)/fcr 
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The sums of e’s in the exponents exhaust the possible energy states E for 
the system. Thus we have in this simple example, and in general, 

Q=E e~*t' tT = (E (E —Mr-. (3-5) 

That is, the canonical ensemble partition function Q for the whole system 
is a product of separate q’a for the independent molecules or subsystems. 

In Chapter 5 we shall apply Eq. (3-5) to the small vibrations of identical 
atoms about their equilibrium positions in a monatomic crystal. The 
model adopted is restricted: each atom in the crystal is confined in its 
motion to the immediate neighborhood of a particular site (equilibrium 
position) in the crystal lattice. Of course, in real crystals the atoms can 
jump occasionally from one site to another, but this is not allowed in the 
model. Now suppose we number the sites in the crystal lattice in some 
regular order. Then these labels can be considered as belonging as well to 
the particular atoms confined to the various sites. In this sense, and for 
this model, we can regard the atoms of the system to be identical and 
distinguishable. This is an exception (owing to the idealized model) to 
the general quantum-mechanical principle that identical molecules are 
indistinguishable, i.e., that no experiment can distinguish between the 
order ab and the order ba for two identical molecules (see Section 22-1). 

Note that Eq. (3-5) has been derived on the basis that the molecules or 
subsystems are independent and distinguishable (labels a, b, ...). The 
assumption of distinguishability is obviously correct if the molecules or 
subsystems are all different. If, however, the molecules are independent 
and identical, then Eq. (3-5) is correct only if a model artificially introduces 
molecular distinguishability. In this case, the energy states for each of the 
identical molecules will be the same, and 

?« = qt =■••»?= E e ~' ilkT > 

Q = q". (3-6) 

The Einstein model of a crystal (Chapter 5) is the most obvious example, 
but not the only one. 

Incidentally, the independent subsystems might be macroscopic (and 
therefore distinguishable) parts of the complete system in thermal con¬ 
tact with each other. Then we would write Eq. (3-5) in the notation 

Q = QaQb .... (3-7) 

We have already encountered an example of this in Eq. (1-23). Equation 
(3-7) implies that A, S, E, etc., are additive; that is, 

A = A a + A b H-, etc. 
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3-2 Independent and indistinguishable molecules. In a gas of N 
identical and independent molecules, or in a simplified model of a liquid 
(Chapter 16) in which the molecules are assumed to be independent of 
each other, we are faced with the problem of handling independent and 
indistinguishable molecules. Let us return to the simple example following 
Eq. (3-4), where now a and b are identical and indistinguishable. Here, 
from the quantum-mechanical point of view, the arrangement e„i + s b2 
is operationally indistinguishable from the arrangement «a 2 + e»i. That 
is, these two arrangements belong to a single quantum state* and should 
count only once instead of twice in the sum Q — e~ E > /kT (Eq. 3-5), 
which is supposed to be over all quantum states of the system, with one 
term for each state. 

In general, a term in Q of the form 


e -(tai+»tj +—>/*r 


(3-8) 


where a, b, ... are identical and indistinguishable molecules, each in a 
different molecular quantum state (i t* j • • •) as in the example above, 
will occur Nl times if we use Eq. (3-6) for Q. This follows because the N 
states i,j ,... can be permuted in AT! ways among the N molecules a, 6,..., 
and all of these N\ ways will appear in the product q N . The reader may wish 
to verify this by forming the product q a qbQc for three identical molecules 
and three different molecular states, 1, 2, 3. If Eq. (3-6) for Q contained 
terms of this type only, i.e., with each molecule in a different molecular 
quantum state, the correction for indistinguishability would be easy. We 
would simply divide q 1 * by Nl so as not to weight each quantum state 
of the system Nl times in the sum e~ E * ,kT instead of only once. 

Actually, many terms in Eq. (3-6) are not of this sort. For example, 

g—(«qi+*K+«e/+" , )/*r^ (3—9) 


where at least two molecules are in the same molecular quantum state i. 
These terms lead to complications: (a) such states of the system are 
allowed in Bose-Einstein statistics (Chapter 22), but AT! is no longer the 
proper correction factor; and (b) in Fermi-Dirac statistics (Chapter 22), 
quantum states of the system in which two identical molecules are in the 
same molecular quantum state are not allowed at all, i.e., such states 
should not appear in the sum e~ E > lkT . 

In a quantum-statistical treatment of identical and indistinguishable 
molecules, the two possibilities, Bose-Einstein and Fermi-Dirac statistics, 
have to be discussed as separate cases. We postpone such a discussion to 
Chapter 22, because it involves more quantum mechanics than we wish to 
use in Parts I, II, and III. 


* This point is amplified considerably in Chapter 22. 
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We can make further progress at this point only if we restrict ourselves 
to an important special case, which is a limiting or asymptotic form of both 
Bose-Einstein and Fermi-Dirac statistics: the number of molecular states 
“available” (i.e., with energies between the molecular ground state and the 
ground state plus, say, 10 kT, more or less) is very large compared with N, 
the number of molecules in the system. In this case, in the product Q — q N , 
the number of terms of type (3-9) will be negligibly small [virtually all 
terms will be of type (3-8)] simply because so many choices of molecular 
states are available to the molecules that two molecules only rarely find 
themselves in the same state. The Bose-Einstein and Fermi-Dirac com¬ 
plications above then disappear. Thus, the correction Nl is now appro¬ 
priate for all terms making a significant contribution to Q, and we have 

Q = Y\ qN (3 “ 10) 

for a system of identical, indistinguishable molecules satisfying the con¬ 
dition that the number of available molecular states is much greater 
than N. We shall see in Chapter 4 that for a monatomic ideal gas (or for 
the translational quantum states of a diatomic or polyatomic gas), this 
condition is satisfied at ordinary temperatures and densities. 

The limiting form of Bose-Einstein and Fermi-Dirac statistics, repre¬ 
sented by Eq. (3-10), is called classical or Boltzmann statistics. The 
interconnections between the three types of statistics will be examined in 
Chapter 22. 

In the expression q = e~*> lkT for a molecule in a gas of independent 
molecules, the (j depend only on the volume V, besides molecular param¬ 
eters such as mass, moments of inertia, force constants, etc. We shall see 
this in detail in Chapters 4, 8, and 9. In other words, q = q(V, T), a 
function of the thermodynamic variables V and T only. It is interesting 
that this conclusion alone [i.e., without the exact form of q(V, T)], together 
with Eq. (3-10), determines the ideal gas equation of state (Problem 3-1), 
for we have 

E = £ Q(N, V, T)\ n = X) [g( y Xl * = « 8X = e pVlkT (3-H) 

N >0 N >0 

and 

(3 - 12) 

Of course, we still have to investigate more closely, in Chapter 4, the 
conditions under which Eq. (3-10), which we have used here, is valid. 
Incidentally, we cannot deduce Eq. (3-12) for an approximate model 
(mentioned at the beginning of this section) of a liquid in which the 
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molecules are assumed to be independent, because in such a model q will 
be a function not of V and T but of V/N and T (Chapter 16). 

Returning to the relation Q — q N /N\ for a one-component system, 
where q = £,• e~*> ,kT , an important simplification or approximation 
(Chapters 4,8, and 9) is that each e is itself expressible as a sum of separate 
contributions, because the Hamiltonian for one molecule is at least 
approximately separable. For example, 


H = H, + H, + H v + H „ 

€ = ft + + «» + «*> 


(3-13) 


where t = translation, r = rotation, v — vibration, and e = electronic 
In this case (Chapters 8 and 9), just as in Eq. (3-5), 


9 = 9<9r9»9« 

0 = Jfi (MrJ^/- 


(3-14) 


Equation (3-10) is easily generalized for systems with several com¬ 
ponents. For example, for a binary mixture (Problem 3-2), 


Q = 


Ni\N z l' 


(3-15) 


3-3 Energy distribution among independent molecules. Consider 
first a system of N independent, distinguishable molecules, with q = 
j e~*i ,kT . (At the end of this section we shall see that the final equations 
derived here for independent, distinguishable molecules are also valid for 
independent, indistinguishable molecules, provided that the use of Boltz¬ 
mann statistics is justified.) The question arises: What fraction of these 
molecules will be in some particular molecular quantum state, say state i 
(energy €,•)? An equivalent question is: What is the probability that one 
particular molecule, say molecule 1, will be observed, in an experiment, 
to be in state i? Or, what fraction of the time does molecule 1 (or any 
molecule) spend in state t? To answer, we use Eq. (1-28) as a starting 
point. According to this equation, the probability that molecule 1 is in 
state i, molecule 2 in state j, molecule 3 in state l, etc., is 


Pm ■ • • 


e -E ilV ..ikT c _( € . + « } . +6l+ ...)/*r 

Q = i* 


(3-16) 


since an assignment of a molecular state to each molecule specifies a 
definite quantum state of the whole system. Let jj,- be the probability that 
molecule 1 is in state i irrespective of the states of the other molecules. This 
is just the probability asked for above. Then 
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Fig. 3-1. Probability ij of a molecule having an energy t, at three different 
temperatures (schematic). 

u Y- P e-^CEi e-'> lkT )(Z, e -'* kT )... 

n. - 2^ - ~nN - 


g-V* 1- 

9 


(3-17) 


This has precisely the form of Eq. (1-28) itself, but molecular states re¬ 
place system states. Of course, jj< = 1. The probability ij< decreases 
exponentially with increasing «,■, and the fall-off with e< is more rapid at 
lower temperatures (Fig. 3-1). At very low temperatures, practically all 
molecules will be in the ground state (e t ): ?n —» 1 as T —* 0. At high 
temperatures, m —> constant (independent of e<). We shall encounter 
several numerical examples of Eq. (3-17) in later chapters. 

For simplicity of notation we shall continue in this chapter to use the 
language of molecular states. But for many purposes, it is more convenient 
to use molecular energy levels. If u is the degeneracy of the level e, then 


use ‘ 

ri (level) = wit (state) = —-— > (3-18) 

5=2 e~ t i ,kT = 2 (3-19) 

i i 

(states) (levels) 



66 


INDEPENDENT MOLECULES OR SUBSYSTEMS 


[CHAP. 3 


Subscripts have been dropped in Eq. (3-18) to avoid confusion. The 
analogy with Eqs. (1-37) and (1-38) for system energy states and levels 
is obvious. 

The average energy of a molecule is 

l = E = ^E (3-20) 

i 9 i 

Because of additivity (independent molecules), the average energy of the 

system is simply _ 

® + cy + </ + ••* = Ne. (3-21) 

If we denote the average number of molecules in state j by £y, then 

V) = Niij, E Vj = N, (3-22) 

$ 

so that 

® = E N *i*i = E (3-23) 

i i 

Fluctuations in € are of some interest. If we differentiate the equation 

3* - E ^ ,iT 

i 

with respect to temperature, we find 

«*- (?) 2 = = (3-24) 

The last relation follows from Eq. (3-21). Since Cv — O(iVfc) and e — 
0(kT), we see that [i 5 — («) 2 ] 1/2 is of the same order of magnitude as I 
itself. That is, the energy probability distribution function for a single 
molecule is broad and not sharp, in contrast to the total energy distribution 
for a system of many molecules (Eq. 2-5). The order of magnitude we 
find here for the molecular energy fluctuations is consistent with the order 
predicted by Eq. (2-5) if we put N — 1. A sharp energy probability dis¬ 
tribution is a many-molecule effect: the large relative fluctuations in the 
energies of individual molecules practically cancel each other, leaving 
only an extremely small relative fluctuation in the total energy. 

A broad probability distribution is in general not gaussian, and the shape 
of the curve is not completely characterized by the quantity e 5 — (I) 2 . 
The most familiar example of a broad distribution is the Maxwell-Boltz- 
mann distribution in velocity or translational kinetic energy for a classical 
one-component system (Chapter 6), usually chosen as an ideal monatomic 
gas. 
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At this point let us take advantage of the relative simplicity of a system 
of independent molecules to look again [see Eqs. (1-17) and (1-18)] at 
the molecular interpretation of thermodynamic work (TF) and heat (Q*). 
From Eq. (3-23), holding N constant, 

dE =Y, dVj + 2 Vj dtj = DQ* — DW. (3-25) 
i i 

Then 

DW = pdV=J2 Vi (- |Sf) dV, (3-26) 

or 

P=E^(-^)‘ (3-27) 

Each molecule makes an additive contribution to the pressure, in an 
amount depending on the change of its energy e, with volume. When 
work DW is done by the system, the (average) population of different 
energy states remains fixed, but the energies («y) themselves shift. The 
negative of the thermodynamic work, — DW, is the total work necessary 
to shift the energy levels of all the molecules in the system when the 
volume is varied by an amount dV. 

Also, from Eq. (3-25), 

DQ* — ) 1 tj dVj, (3—28) 

i 

which says that heat DQ* is absorbed by shifting the population of energy 
states, holding the energy levels fixed. Consider a special case: heat is 
absorbed by the system as a result of a temperature increase dT, with N 
and V both held constant. Then 

dE = C v dT = DQ* = T dS = £ (j ^ dT. (3-29) 
The change in population of state j is, from Eqs. (3-17) and (3-22), 

dVi = d $dT = ^ ~ 8 s) dT ■ (3-30) 

That is, if dT is positive, the population of states with energy greater than 
i increases at the expense of those states with energy less than l. For a 
state with energy larger than e by an amount of order kT, the fractional 
population increase ( dVj/Vj) is of the same order as the fractional tem¬ 
perature increase ( dT/T ). 
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If we substitute Eq. (3-30) into Eq. (3-29), we recover Eq. (3-24) for 
Cy (Problem 3-3). In the special case (T —> 0) that all molecules are 
in the ground state (j = 1), «y — ? = 0 for j — 1 and Uj = 0 for 
j > 1; hence, by Eq. (3-30), diCj/dT — 0 at T — 0 for all j and Cy — 0 
(Problem 3-3). 

If a molecule has separable or approximately separable “internal” 
degrees of freedom, as for example in Eq. (3-13), the energy distribution 
can be broken down one step further. Thus, in this example, Eq. (3-17) 
becomes 


Vijlm — 


0 — 


(3-31) 


where the molecular quantum state (ijlm) is characterized by the transla¬ 
tional state (t), rotational state (J), etc. Then the probability that, say, 
the rotational state is j, irrespective of the translational, vibrational, and 
electronic states (or the fraction of all molecules in rotational state j), is 

«?”* = E natm = (3-32) 

i.l.m q ' 

just as in passing from Eq. (3-16) to Eq. (3-17). Applications of this equa¬ 
tion will appear in Chapters 4, 8, and 9. 

Finally, we have to discuss indistinguishable molecules. So far the 
argument in this section has been concerned with the distinguishable case. 
We recall first that we are limited to situations in which the number of 
available molecular states greatly exceeds the number of molecules. In 
the notation used above, the average population (number of molecules) 
per molecular state must be much less than one, (7,- <K 1: every molecule 
in the system is in a different molecular state (with negligible exceptions), 
or, an equivalent statement is that each molecular state is either unoc¬ 
cupied or occupied by just one molecule. 

Equation (1-28) gives the probability of observing a particular quantum 
state of the system of N identical molecules. With indistinguishable 
molecules, a single quantum state of the system is specified if we say that 
one molecule (any one) is, say, in molecular state t, another (any other) 
is in state j, another in state l, etc. Which molecule is in which state is 
immaterial since the molecules cannot be distinguished from each other. 
Equation (1-28) then reads 

Piil • • •- wm — (3 ~ 33) 


Note that this probability is N\ times larger than the corresponding 
probability (distinguishable molecules) for a single quantum state of the 
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system in Eq. (3-16). This is because, in the case of indistinguishable 
molecules (when??,- « 1), the system has a total of N\ times fewer quantum 
states from which to choose. To find the probability that any one molecule 
(not one particular molecule, such as molecule 1) in the system is, say, in 
state i, we should sum P,yj... above over all values of j, l,m, _How¬ 

ever, different permutations of the values of j, l, m, ... should not be 
included in the sum, because such permutations do not generate new 
quantum states of the system. For example, if N = 4, in the sum over j, 
l, and m: j = 4, l = 6, and m = 17, say, should be included; but then 
j = 6, l = 4, and m = 17, or j =17,1= 4, and m = 6, etc., should be 
omitted. On the other hand, it is actually simpler to include all values of 
j, l,m, ... and correct for the permutations referred to above by dividing 
by (N — 1)!. With a sufficiently large number of molecular quantum 
states (Cj <K 1), we can ignore the rare terms in the sum in which j = l, 
oil = m, etc. 

In summary: if we sum P,-,j... over all values of j, l ,... and divide by 
(N — 1)1, the result is the probability that any one molecule in the 
system will be in state t. But this is just the quantity Therefore 


“ (N - 1)! Piil •' • 

3 !*»•*• 


Ne~*' liT 


r,«i. 


e -*JkT qff -1 

(N - l)\(q N /Nl) 


(3-34) 


As a check, we note that if we also sum over all values of i and divide by 
N \instead of (N — 1)1, we get 


m 




q N /Nl . 
“ q N /N\ ~ ’ 


as we should. If we define tj< by in = U./AT, as in Eq. (3-22), then 


e -*il kT 1 

Vi = —-— > v% « jj > (3-35) 

which is the same as Eq. (3-17) for distinguishable molecules. That is, 
Hi is the probability that one particular molecule is in state t. Or, more 
properly here, since a “particular” molecule is a nonoperational concept, 
V, is the fraction of all molecules in state i. This is a very small number, 
since itself is already a small fraction. 

Thus, as one would expect intuitively and as we anticipated at the 
beginning of the section, the molecular energy distribution for a given set 
of energy states tj is the same for distinguishable and indistinguishable 
independent molecules. There is the important distinction, however, that 
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Eq. (3-34) is valid for indistinguishable molecules only if <?,• « 1, whereas 
no such restriction is operative for distinguishable molecules (e.g., as 
T ->• 0, ->• N). 

The high density of molecular states necessary to satisfy 7Jj <K 1 is 
guaranteed in most applications (as we shall see in the next chapter) just 
by the translational degrees of freedom. Then if there are internal degrees 
of freedom, as for example in Eq. (3-31), i ujt» « 1 /N for a single molecu¬ 
lar state. But if we now sum nijim over all translational states i, the result 
is the fraction of molecules in a given rotational-vibrational-electronic 
state jlm —a number which is not required to be small compared with 
unity. In other words, the internal distribution of indistinguishable 
molecules among nontranslational energy states is ordinarily unrestricted, 
just as in the case of distinguishable molecules. Thus, Eq. (3-32) gives 
the distribution over rotational states; the distribution is the same for 
distinguishable and indistinguishable molecules; and there is no restric¬ 
tion on ijy ot [for example, at low temperatures, )ji ot —► 1 (Chapter 8)]. 

3-4 “Ensembles” of small, independent “systems.” In the Einstein 
model of a crystal, each of the N molecules carries out its own independent 
vibrational (and possibly also internal) motion around a definite lattice 
point, and hence is “distinguishable. ” Furthermore, the molecules are in 
thermal contact (“weak interaction”) with each other. The reader will 
recognize that this situation is rather closely analogous to that of the 
canonical ensemble of Section 1-3. There we were dealing with an en¬ 
semble, a collection of 31 independent, distinguishable macroscopic systems 
in thermal contact with each other. Here a single molecule is a “system,” 
and the crystal (system) is the “ensemble.” If we re-examine the argu¬ 
ment in Section 1-3, we see that it is still valid even when the systems of 
the ensemble are very small. In the crystal example, the system is a single 
molecule, but other cases are possible in which the system contains more 
than one but still only a relatively small number of molecules. To avoid 
confusion, let us return now, as in the first part of this chapter, to the use 
of the term “subsystem” in referring to one of these small systems. If we 
let Cj be the number of molecules (or subsystems, generally) in the mo¬ 
lecular state j with energy «/, then a set of numbers C i, 0%, ... is a “distri¬ 
bution,” and we have, in place of Eqs. (1-1) through (1-3), 


Z^ = N, 

3 

Z = E, 

3 

m = ife’ 


(3-36) 

(3-37) 


(3-38) 
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“ensembles” of small, independent “systems” 


where Q(C) is the number of quantum states of the macroscopic system 
(e.g., crystal) for a given distribution C. The macroscopic system is iso¬ 
lated (i.e., N, V, and E are constant). As in Eq. (1-12), we find that 


Vi 






N 


N ~ E,- 


(3-39) 


which is consistent with Eq. (3-17). The macroscopic system has to be 
very large (i.e., N —♦ oo) in order fully to justify the replacement of??,- 
by C*j and the use of Stirling’s approximation on In C,-!. The connection 
with thermodynamics (e.g., 0 = 1/kT) is most easily established through 
the relation S = fc In ft (C*) and the thermodynamic equation (1-78) 
(Problem 3-4). The most probable or “equilibrium” distribution C* is 
the one which makes (1(C), and therefore S, a maximum in an isolated 
system, in agreement with the second law of thermodynamics (Section 2-3). 

The temperature of a single molecule (or subsystem) is well defined. 
This is the temperature which appears in the Boltzmann energy probability 
distribution for a single molecule, Eq. (3-39) with 8 = 1/kT. But other 
functions, such as the energy, are not well defined, in the thermodynamic 
sense, for the individual molecules (or subsystems) because of large fluc¬ 
tuations about mean values (e.g., ? in Eq. (3-24): a measurement of e will 
not almost certainly yield «, as would be the case for S in a macroscopic 
system]. 

Of course, no new results are obtained by adopting the above “sub¬ 
system” point of view. Furthermore, the argument is restricted to macro¬ 
scopic systems made up of independent, distinguishable molecules or sub¬ 
systems; it is a special case of the more general argument of Section 1-3. 
However, when possible (independent molecules or subsystems), there is a 
considerable intuitive or conceptual advantage to thinking in terms of a 
system containing only one or a few molecules rather than a macroscopic 
number. 

We have seen in the preceding section that when a system of N inde¬ 
pendent, indistinguishable molecules satisfies the condition ??,-« 1, the 
energy distribution is the same as for distinguishable molecules but the 
total number of quantum states of the macroscopic system is reduced by a 
factor N\. We therefore anticipate that the (molecule, system) <-> (system, 
ensemble) analogy, argument, and results above for distinguishable mole¬ 
cules will still hold for indistinguishable molecules, provided only that 
we use (1(C)/Nl in place of ft. There is, however, a serious complication: 
since Uj — C^ « 1, we have to use Stirling’s approximation on very 
small numbers. Actually, this procedure happens to give correct results, 
but the method can hardly be considered satisfactory. The rigorous way 
to handle this problem, with the same results, is to use the Darwin-Fowler 
application of the method of steepest descents. 
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It should be emphasized that our derivation in the preceding section of 
Eq. (3-34) for Vj in a system of independent, indistinguishable molecules 
is not open to the above objection. That derivation is based, ultimately, 
on the use of a canonical ensemble of 91 distinguishable, macroscopic 
systems, with 91 —» oo (Section 1-3). 

In a number of applications it is very helpful to use an extension to the 
grand ensemble of the “subsystem” point of view brought out above for 
the canonical ensemble. Since these applications are somewhat specialized, 
we shall postpone (see, however, Problem 3-5) our discussion of this 
subject to Chapter 7 (“binding” or adsorption on sites) and Chapter 22 
(Bose-Einstein and Fermi-Dirac statistics). 


Problems 


3-1. For a gas of independent molecules, InE = q(V, T)\ = N (Eq. 3-12). 
Use Eq. (1-68), p - kT (d In E/dF)„,r, to deduce that q must have the form 
q — Vf(T), where/(T) is a function of temperature only. (Page 63.) 

3-2. Extend the derivation of Eq. (3-10), Q = q^/NI, for a one-component 
system of independent, indistinguishable molecules to the two-component case, 
Eq. (3-15). (Page 64.) 

3-3. Use Eq. (3-30) to verify that 


Cv = £ 
i 



m 2 - (e) 2 l 

kT * 


for a system of independent molecules. Note that Cr is never negative. (Page 

68 .) 

3-4. Show that substitution of the most probable distribution (3-39) into 
S = k In Q(C*) leads to 

A = —NkT In 


Also, with the aid of ( dS/dE)vj/ = l/T (Eq. 1-78), prove that /S = 1/kT. 
(Page 71.) 

3-5. Consider a one-component macroscopic system which consists of M 
independent, equivalent, and distinguishable subsystems. Here M plays the 
role that V does in most systems. Each subsystem can contain any number from 
zero up to m molecules, where m > 1. Modify the notation used in Section 1-5 
(the system here corresponds to the grand ensemble in Section 1-5) to obtain the 
equivalent of Eq. (1-45) for the probability distribution. Use S = fclnO 
(most probable distribution) to establish connections with thermodynamics. 
Note the analogy with Eqs. (1-75) through (1-77). (Page 72.) 

3-6. For a gas of independent molecules, Q = q(V, T) K /N\. From the ca¬ 
nonical ensemble equations (1-31) through (1-36) prove, as in Problem 3-1, 
that q = Vf(T). 
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3-7. Use Eq. (3-15) for a two-component system, where qi and ?2 are func¬ 
tions of V and T only, to deduce by the method of Eqs. (3-11) and (3-12) that 

*+*-Sr 

3-8. Show from Eqs. (2-4) and (3-24) that 

= N 


i 


TZ 


This emphasizes that the e probability distribution is much broader than the E 
distribution for N large. 
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CHAPTER 4 


IDEAL MONATOMIC GAS 


As our first specific example we consider a one-component monatomic 
gas dilute enough so that intermolecular forces can be ignored. At the 
outset, each atom is treated as a mass point (internal degrees of freedom 
will be discussed briefly in Section 4-4) with three translational degrees of 
freedom. This discussion will also be applicable, in Chapters 8 and 9, to 
the translational degrees of freedom of dilute diatomic and polyatomic 
gases, because of the additivity of the Hamiltonian function [as in 
Eq. (3-1)]. 


4-1 Energy levels and canonical ensemble partition function. By virtue 
of the high dilution, the molecules of the gas are independent of each other. 
They are also indistinguishable. Therefore, the canonical ensemble 
partition function is (Eq. 3-10) 

<3 - q N , (4-1) 


provided that the number of available molecular quantum states is large 
compared with N. Our first task is to investigate q. 

Let the volume V be in the shape of a cube of edge L; then V — L 3 . We 
choose a cube merely for convenience; thermodynamic properties of the 
gas do not depend on the shape of the container. In the sum q = e~*> lkT , 
the energies €y in this case are those associated with one molecule pos¬ 
sessing three translational degrees of freedom only and confined to a cubical 
box. This is a standard problem in quantum mechanics. The possible 
energy states are 




h 2 (lx -j- ly -j- l ») 

8mL2 


(4-2) 


l*> lyi It — 1, 2, 3, . . . , 


where l x , l v , and l, are the three quantum numbers. 

In the three-dimensional lJJ t space (Fig. 4-1 shows a two-dimensional 
version of this space), there is a one-to-one correspondence between pos¬ 
sible molecular quantum states and points in the space with positive inte¬ 
gers as coordinates. Therefore, there is one quantum state per unit volume 
of this space. The equation 

lx + 1% + l 2 = R 2 , 
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(4-3) 
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Fig. 4—1. 



with 


n2 8mV 2/3 e 
K = -rs- ’ 


(4-4) 


is the equation of a sphere of radius R in IJyl, space. The volume of the 
positive (l x , ly, h all positive) octant of a sphere of radius R is tR 3 /Q. 
This is also the number of quantum states 4(«) with energy less than e, 
where e and R are related by Eq. (4-4), provided that 4(e) is very large 
so that edge effects are negligible. That is, 


4(e) = 



(4-5) 


For Eq. (4-1) to be legitimate, we have to have 4 N when we put 
e = 0(kT) in Eq. (4-5), since states with e very much larger than kT will 
not be appreciably populated and are therefore effectively unavailable. 
Later it proves convenient to choose the unimportant numerical coeffi¬ 
cients, of order unity, so that this condition reads 


or 


where 



* _ ™_ 

(2wmkT) i l 2 


(4-6) 

(4-7) 


We see that the condition (4-6) is favored by low density, large mass, and 
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high temperature. This same condition is useful when applied to diatomic 
and polyatomic molecules as well, for the total density of molecular 
quantum states is practically determined by the density of the translational 
states alone (i.e., the translational energy levels are relatively very close 
together). 

Even though we are interested in dilute gases here, let us make a strin¬ 
gent test of (4-6) by using the (experimental) density of the liquid phase at 
the normal boiling point. We find the results in Table 4-1, which will be 


Table 4-1 



T, °K 

A 3 N/V 

He 

4.2 

1.5 

h 2 

20.4 

0.44 

Ne 

27.2 

0.015 

A 

87.4 

0.00054 


referred to again in Section 6-2 and Chapter 22. In the gas phase at the 
boiling point, the values of A 3 N/V would be reduced further by a factor 
of the order of 100. The conclusion we reach, then, is that, except for 
light molecules at appreciable densities and very low temperatures, the 
condition (4-6) is easily satisfied (Problem 4-1). 

It is interesting to note that A is a length of just the order of magnitude 
of the de Broglie wavelength h/mv (v = velocity) of a particle with 
kinetic energy of order kT. Since (V/N) 113 is a distance of the order of 
the average nearest-neighbor distance between molecules, Eq. (4-6) 
asserts that quantum effects (requiring the use of Bose-Einstein or Fermi- 
Dirac statistics) will be absent if neighboring molecules are usually far 
apart relative to the “thermal” de Broglie wavelength A. 

We return now to 

q = ^ e~* v v , ‘ lkT . (4—8) 

We can replace the sum by an integral if, in passing from one energy level 
to the next higher level, Ae « kT, for in this case the summand will change 
its value essentially continuously with l x , l y , and l,. From Eq. (4-2), 
M = 0(h 2 /mV 2 ' 3 ) and 

— = 0 (—— - 

kT \mkTVW , 



But we already have the restriction (4-6), which can be written 
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A 2 

V2/3 


« 


JV2/3 


0 ( 10 - u ). 


Thus the requirement A e/kT « 1 is overwhelmingly satisfied. 

To integrate Eq. (4-8) (see also Problem 4-2), we note from Eq. (4-5) 
that the number of molecular quantum states between « and « 4- de is 

«« * = f ( de = l (^) 3 'Ve 1/2 de. (4-9) 

Therefore [compare Eq. (3-19)], 

q(V, T) = o>(e)e~ tlkT de = f* u ll2 e~ u du 

_ (2irmkT\ 3l2 v _ V 

\~P - / F = Ai’ (4 ~ 10) 


where u = e/fcT. This confirms Problem 3-1. That is, q is proportional 
to V. We see that q is dimensionless and has a very simple expression in 
terms of the thermal de Broglie wavelength A. 

The fraction of molecules with energies between e and e + de is [compare 
Eq. (3-35)] 


P(e) de = 


u(e)e~ ,lkT de 

9 


(4-11) 


This is essentially the well-known Maxwell-Boltzmann distribution 
(Section 6-4), since to « e 1/2 . 

We have for the canonical ensemble partition function 


n _ 1* _ 1_(VY 
Q ~ W! 9 “ATI \AV ’ 

InQ = -WinW + N + Win q = WIn[(—^]- 


(4-12) 

(4-13) 


We can now derive all the thermodynamic properties of the gas from this 
equation. 


4-2 Thermodynamic functions. First, the Helmholtz free energy A 
is simply 

A(N, V, T) = -kTlnQ = -NkTln[(^~y' 2 —]• (4-14) 

We can confirm that A is an extensive property. That is, A/N is seen to 
be a function of the intensive variables V/N and T only. We should 
emphasize that in this equation and others below: (a) the zero of energy 
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for each molecule is implicit in Eq. (4-2) (i.e., the zero is at the bottom of 
the potential well or box of volume V —classically, this corresponds to a 
particle at rest); and (b) the entropy <S = k In $1 is based on the number 
of translational quantum states of the system of N molecules (i.e., the 
zero of entropy is associated with 0=1, although this situation is un¬ 
realizable for a dilute classical gas). 

The pressure is given by 



It was pointed out, following Eq. (1-27), that A: is a universal constant 
whose value can be obtained from any convenient experimental system. 
We see from Eq. (4-15) that 


lim 

AT/V-.0 


pV 

NT 


for a real gas of known molecular weight will give k. The value found is 
k = 1.38044 X 10 -16 erg-deg _1 . 

The energy is 

E = kT 2 N = NkT 2 dl Pjy 3/ - = | NkT. (4-16) 

This is all kinetic energy, kT /2 per degree of freedom per molecule. The 
potential energy is zero because of the absence of appreciable intermolecular 
forces and by virtue of our choice of the zero of energy. Also, 

- (ff)™ “ §“• (4 - 17) 


as is found experimentally. From Eqs. (4-15) and (4-17), 


pV = j \E. (4-18) 

Equations (4-15) through (4-18) are often called “classical” results 
because they can be derived directly from classical statistical mechanics 
without any use of quantum theory (Chapter 6). The use of Boltzmann 
statistics (Eq. 4-1) and the replacement of a sum over quantum states by 
an integral, as in Eq. (4-10), lead to classical results from a quantum- 
mechanical starting point. This statistical-mechanical correspondence 
principle is explored in much more detail in Chapters 6 and 22. 

Equation (4-18) is interesting because it is obtained, for a dilute mona¬ 
tomic gas, not only from Boltzmann statistics but also from Bose-Einstein 
and Fermi-Dirac statistics (Chapter 22). We can anticipate this result by 
an argument based on the energy-level expression (4-2) alone, without 
use of Eq. (4-1) (Boltzmann statistics). We have to assume that [see 
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Eqs. (3-21) and (3-27)], for a dilute enough gas in a volume V, p and E 
are proportional to N. That is, each molecule makes an independent or 
additive average contribution to p and E. For a molecule in state j 
(j = l x , l y , l t ), the energy, according to Eq. (4-2), has the form = 
<tj/V 2,z , and the contribution of this molecule to the pressure is [compare 
Eq. (1-7)] 


_ _ 2 v _e /3 _ 2 e> 

dV~ Z 1 ~ 3 V 


Now we average over all states j, multiply by N, and find 


p = N 



_ZNl_2E_ 
“3 V ~3 V 


9 


(4-19) 


which is the same as Eq. (4-18). 

The entropy follows from Eqs. (4-14) and (4-16) for A and E: 


S = 



= Nk In 



(4-20) 


From (4-6), we see that the quantity in brackets is necessarily large com¬ 
pared with unity, say 10 s or more, but the logarithm of the quantity will 
be of order unity. Hence S = 0(Nk), as expected from thermodynamics. 
In terms of the pressure instead of the number density N/V, Eq. (4-20) 
becomes 




(4-21) 


Equation (4-21) is in agreement, as it should be, with the well-known 
thermodynamic expression for the entropy increase in an isothermal 
expansion of an ideal gas, 

AS = S(p 2 , T) - S(p lt T) = Nkln^l- 

P 2 

For the chemical potential and Gibbs free energy, we have 


F 



—kT In 


= -kTl n 



= —kT In 


^2 wmkTy 12 fcTj 


(4-22) 

(4-23) 

(4-24) 
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Equation (4-23) also follows from F = Np = A + pV and Eqs. (4-14) 
and (4-15) for A and pV (Problem 4-3). 

In thermodynamics one finds that, for an ideal gas, 

/t(p, T) = p°(T) + kT\np, (4-25) 

where p°(T), often called a “standard chemical potential* or “standard 
free energy,* is an integration constant. For a monatomic gas, Eq. (4r-24) 
provides a statistical-mechanical evaluation of this integration constant 
(based on the zeros of energy and entropy already referred to): 

p°(T) = -kTln^(~~y i2 kT^- (4-26) 


It should be noted that the quantity in brackets here has dimensions of 
pressure, and must be in the same units (e.g., atm, mm Hg, erg-cc -1 , etc.) 
as p in Eq. (4-25). Clearly the numerical values of the separate terms on 
the right of Eq. (4-25) depend on the choice of the pressure unit, but 
their sum gives a p(p, T) which is independent of this arbitrary choice. 
In several places in this book it will prove convenient to introduce standard 
chemical potentials p°(T). The above point about pressure units must 
always be kept in mind when a p°(T) is encountered. Of course, in com¬ 
puting any truly operational quantity, the combination in Eq. (4-25), 
or pe‘ fi,kT , will occur, and the matter will then automatically take care of 
itself [e.g., there is no complication with Eq. (4-24) where the two terms 
in Eq. (4-25) are kept together]. 

This completes our summary of the basic thermodynamic functions of 
an ideal monatomic gas. Other functions (e.g., the heat content H) 
follow from those already given. From this first example, we can see the 
power of the statistical-mechanical method: it provides an explicit and 
detailed theoretical prediction of all the thermodynamic properties of the 
system. We have to reserve comparisons of equations such as (4-21) and 
(4-24) with experiment (entropy, crystal vapor pressure, chemical equi¬ 
librium constants, etc.) until we have discussed some other systems (see 
Chapters 5, 9, and 10). 

For a binary mixture of dilute monatomic gases, according to Eq. (3-15), 


where 


Q = 


v, v, 
gl g2 

NiW 2 !’ 



and 



(4-27) 

(4-28) 


In (4-28), Ai and A 2 differ only through m 1 and m 2 . From 
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In Q = -Nt In N 1 + N 1 + N l In 9l (F, T) 

N 2 In N 2 + N 2 + N 2 In q 2 (V, T) (4-29) 

and Eqs. (1-31) through (1-36), we find easily (Problem 4-4) 

pV = (Ni + N 2 )kT, (4-30) 

E=Wi+N 2 )kT, (4-31) 


etc. The familiar thermodynamic formula for the entropy of mixing two 
Ideal gases follows from Eq. (4-32) (Problem 4-5). 


4-3 Grand ensemble and others. It is instructive to try out partition 
functions other than the canonical ensemble partition function on this 
simple system. We have in fact already derived the equation of state 
from the grand partition function in Eq. (3-12). We should note that 
Eq. (3-12) also gives 


N 

9 


T7 A 8 


= X = d flkT , 


(4-33) 


which is a slightly disguised form of Eq. (4-23) for p. 

As another example, let us use the partition function A (N, p, T) of 
Eqs. (1-87), (1-91), and (2-12): 

- ^ = In A = In J Q Q(N, V, De~ pr/kT d (g) • (4-34) 


Here V is continuously variable, so we use an integration over V instead 
of a sum. The choice of the dimensionless variable of integration is some¬ 
what arbitrary,* but pV/kT is particularly convenient for this purpose. 
If we let x = pV/kT, 


Then 


a= w\Q£)" L * Ne * dx 

= (j^Y' 

\p\ 3 / 


(4-35) 


F = Np — —kT In A = —NkT In 



(4-36) 


* See S. M., pp. 63 and 68. 
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which is the same as Eq. (4-24). Also, 


S = - 




in agreement with Eq. (4-21), and 


V = 


(") 

\dp/ T,N 


NkT 

V 


(4-37) 


(4-38) 


We shall not work out the details of the microcanonical ensemble ab 
initio* But we can easily find Q(1V, V, E) from the canonical ensemble 
equation (4-20) and the relations S = k In 0 and kT — 2E/3N: 


Q(2V, V,E) 



(4-39) 


This is a tremendously large number, of order e N or 10 10 ’°. As a partial 
check (Problem 4-6), we note that [using Eq. (1-79)] 


j_ / aino N m 

kT \ BE ) 2 E 


(4-40) 


4-4 Internal degrees of freedom. So far in this chapter we have treated 
monatomic molecules as mass points with three translational degrees of 
freedom, using the energy states of a particle of mass m in a cubical box of 
volume V. Of course, an atom is not a mass point but has electronic and 
nuclear substructure. Associated with this substructure are electronic and 
nuclear energy states, deducible in principle from quantum mechanics 
but in practice, in most cases, from spectroscopy. We wish to show briefly 
here how these “internal degrees of freedom” influence the thermodynamic 
functions of the system. 

The first comment to make is that the translational Hamiltonian is 
rigorously separable from the electronic and nuclear Hamiltonians. 
Second, the electronic and nuclear Hamiltonians are also separable, to a 
very high accuracy. Hence the basic equation is (3-14): 

Q = jJj(9«M,) W . (4-4D 

We have already discussed q f . We now turn to a consideration of the 
nuclear and electronic energy levels, and hence of q n and q„ 

Because of the extremely large separation between the two lowest 
nuclear energy levels [A«» = 0(1 Mev); or Ae n = O(kT) when T = 


* See Mayer and Mayer, pp. 109-117. 
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O(10 10o K)], at ordinary temperatures the atom is certain to be in the 
nuclear ground state. This follows from Eq. (3-32). We shall denote the 
degeneracy of the nuclear ground state by <o„i. The situation is similar, 
but not so extreme, for the electronic energy levels. The separation As, 
between the two lowest levels is usually of the order of 1 ev; Ac, = 0(kT) 
when T = 0(10,000°K). Therefore again, according to Eq. (3-32), at 
ordinary temperatures (say T < 1000°K) the atom is almost certain to 
be in the electronic ground state. However, since there are exceptions to 
this, we mention a few particular cases to give some idea of the range in 
behavior. 

In the inert gases, He, Ne, A, etc., As, is large, of order 10-20 ev. Hence 
only the ground state is important. The degeneracy to,i of the electronic 
ground state ( l S 0 ) is unity for these atoms.* 

For the alkali metal atoms, As, = 0(1.5 ev). Here again excited elec¬ 
tronic states can be ignored at ordinary temperatures. In this case 
<o,i = 2 CS 1 / 2 ) owing to the unpaired electron spin. Similarly, for the 
hydrogen atom, As, = 10.2 ev and < 0,1 = 2. 

In the case of the halogen atoms F, Cl, Br, and I, As, = 0.050, 0.11, 
0.46, and 0.94 ev, respectively. The degeneracies of the first two levels 
(the third level has a significantly higher energy and can be ignored here) 
are 4 (^ 3 / 2 ) and 2 ( 2 Pi/ 2 ), respectively. Therefore the fraction of halo¬ 
gen atoms in the first excited level is (Eq. 3-32) 

= «, *r' a,kT _ 2e~ A< ‘ /tr , . 

1,42 Utie-UiikT + to, 2 e-«.a/*r 4 + ’ 11 

As, = S,2 — S,j . 

At 1000°K, we find (Problem 4-7) i», 2 = 0.22, 0.12,0.0024, and 9 X 10~ 6 
for F, Cl, Br, and I, respectively. Certainly for F and Cl, then, and pos¬ 
sibly Br, we have to retain two terms in the electronic partition function. 

We choose as zero of energy here an atom in its ground electronic and 
nuclear states and at the bottom (classically, at rest) of the translational 
potential box or well (Fig. 4-2). Then q, (Eq. 4-10) is unchanged and 

9» = «m, s„i = 0, (4-43) 

q e (T) = < 0,1 + «, 2 e — e,i = 0, As, = s, 2 . (4-44) 

The most common case (the inert gases) is q, = »,i = 1. All thermo¬ 
dynamic functions now follow in a routine way from Eqs. (4-41), (4-43), 
and (4-44). 

* For spectroscopic notation and facts, see G. Herzberg, Atomic Spectra and 
Atomic Structure. New York: Dover, 1944. 
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.— ■ - «e2 + «nl + «(1 

'Nea + «*1 

- 1 + «nl + «8 
■ tel + <nl + «1 

'Net + «nl = 0 (««t = <nl = 0) 


Fig. 4-2. Arbitrary location of energy zero. 


For example, the equation of state (pV = NkT) is unchanged, but 

= | NkT + (4-45) 

= Ni t + m t . (4-46) 

Similar equations can be derived for CV, S, etc. (Problem 4-8.) 

In the special case q n = co nl and q, — o>,\, the nuclear and electronic 
degrees of freedom make no contribution to p, E, Cr> etc., but they do 
contribute to all functions involving the entropy: S, A, F, ft, etc. Thus, 

In Q = JV In ( F ^ nl ) « (4-47) 

5 = Nk In = St + kin (to <lWnl )^, (4-48) 

M= - fcr in(l^l). (4-49) 

However, in accordance with the convention established with Eq. (2-43), 
we shall usually omit (Chapter 22 is an exception) the nuclear degeneracy 
u n x from all thermodynamic functions. Because of cancellation, it does 
not ordinarily contribute to measurable thermodynamic quantities 
(entropy change with temperature, equilibrium constants, vapor pressure, 
etc.). We do not extend this convention to u e i, however, because when 
> 1 in the gas phase, the stable state at Q°K (e.g., Cl 2 crystal, H 2 
crystal, sodium metal, etc.) is nondegenerate.* Hence in many cases, 


* See Fowler and Guggenheim, pp. 199-205. 
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unlike «»i, u ei does not cancel. Also, as we have pointed out, excited 
electronic states occasionally have to be taken into account. 


Problems 

4-1. Calculate the temperature at which A 3 N/V =» 0.0001 for He gas at 
1 atm pressure (use the equation of state pV = NkT). (Page 76.) 

4-2. Derive Eq. (4-10) for q by substituting Eq. (4-2) into Eq. (4-8), and 
replacing the sum by a triple integral over l x , l„, and l,. (Page 77.) 

4-3. Obtain Eq. (4-23) for p from Eqs. (4-14) and (4-15) for A and pV, 
and Np = A + pV. (Page 80.) 

4-4. Verify the fact that Eqs. (4-30) through (4-32) for a binary gas mixture 
follow from Eq. (4-29). (Page 81.) 

4-5. Derive an equation for AS in the following process: initial state = 
(a) Ni molecules of ideal gas 1 at T in a container of volume Vi, plus (b) N 2 
molecules of ideal gas 2 at T in a separate container of volume V 2 == N 2 V 1 /N 1 ; 
and final state — Ni molecules of 1 and N 2 of 2 in a single container of volume 
Fi + V 2 , at T. This is usually called the entropy of mixing, AS m ix. (Page 81.) 

4-6. Confirm formulas already obtained for p and p (ideal monatomic gas) 
by using the microcanonical ensemble equations (1-80), (1-81), and (4-39). 
(Page 82.) 

4-7. Verify the values of ij «2 calculated from Eq. (4-42) for F, Cl, Br, and 
I at 1000°K. (Page 83.) 

4-8. Derive an equation for Cy, the electronic contribution to Cr, from 
Eq. (4-45). Verify that Cy —♦OasT—►OorT—►<», and passes through a 
maximum in between. Derive an expression for S,. Check that S, —* k In < 0 $ 
as T —* 0, and S, —» k In ( 0,1 + u, 2 ) N as T —* <*>. (Page 84.) 

4-9. Use Problem 3-3 to show that the asymptotic behavior of Cy as T —> » 
is Cy = constant/T 2 . 

4-10. For argon gas (assumed ideal) at 1 atm pressure and 25°C, calculate 
A, E, and p in cal-mole -1 and Cr and S in cal-mole -1 -deg -1 (see Table 9-2). 
Take 01,1 — 1 and omit w„i. 

4-11. For a one-component ideal gas, show that the maximum term (a) in 
S occurs at N* — gX, and (b) in A occurs at V* — NkT/p. 

4-12. Show that the result of Problems 3-1 and 3-6, q = Vf(T), implies 
that the number $(«) of molecular quantum states with energy less than t is 
proportional to V. 


Supplementary Reading 

Fowler and Guggenheim, Chapter 3. 

Mayer and Mayer, Chapters 5 and 6. 

Tolman, Chapter 14. 
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CHAPTER 5 
MONATOMIC CRYSTALS 

In this chapter we investigate the thermodynamic properties of mona¬ 
tomic crystals, especially the heat capacity Cy. The molecules in a crystal 
vibrate around equilibrium positions which are arranged in a regular 
lattice. Intermolecular forces, together with the pressure and tempera¬ 
ture, determine the lattice structure and the spacing and the nature of the 
motion of the molecules in the neighborhood of their equilibrium positions. 
Offhand then, it would appear that a treatment of this system would 
belong more properly in Part III (which is concerned with systems of 
dependent molecules). But it turns out that despite the importance of 
intermolecular forces in a crystal, so long as the molecular vibrations are 
small, the system can be decomposed mathematically into independent 
subsystems (normal modes of vibration). 

In the first section we treat a monatomic crystal in a very approximate 
manner, using the model of Einstein. This model has the advantage of 
conceptual and mathematical simplicity and leads to results that are 
qualitatively (but not quantitatively) correct. The remainder of the chap¬ 
ter is then devoted to more accurate approaches to the problem, with 
emphasis on the Debye approximation. 

5-1 Einstein model of a monatomic crystaL We begin with some 
comments that establish the point of view adopted in the Einstein model. 
This discussion will also be useful later, since it pertains as well to certain 
approximate models (“cell theories”) of the liquid state (Chapter 16). 

The type of lattice structure is assumed given. Each molecule in the 
crystal is surrounded by a group of first (nearest) “neighbors,” a more 
distant group of second neighbors, etc. The central molecule vibrates 
in the vicinity of its equilibrium or lattice point in a force field which is 
the sum of the separate forces exerted by all the neighbors on the central 
molecule. In the case of inert gas molecules, for example, these inter¬ 
molecular forces are of the van der Waals (dispersion) type with a rather 
short range (see Appendix IV and Fig. 5-1), so that usually only the first 
and second neighbors make an appreciable contribution to the total force. 
The potential energy of a central molecule in this force field has a minimum 
at the equilibrium or lattice point, by definition. The potential energy 
increases in all directions as the central molecule departs from its equilib¬ 
rium position and becomes very large, owing to van der Waals repulsion, 
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Fw. 5-1. One-dimensional crystal with nearest-neighbor distance r*. Curves 
relate to potential energy of B, as a function of position, when other molecules 
(A, C ,...) are fixed at equilibrium (lattice) points. Curve a is B-A potential 
energy, and curve c is B-C potential energy. The dashed curve is the sum of a 
and c, i.e., the total potential energy of B if second-neighbor interactions are 
neglected. 


when a first neighbor is approached (Fig. 5-1). Thus each molecule is 
confined to a “cell” or “cage" bounded by its first neighbors. 

The nearest-neighbor distance in the lattice (at low pressures) will cor¬ 
respond approximately to that distance (r*) between a pair of molecules 
which leads to a minimum in the intermolecular potential energy (Figs. 
IV-1 and 5-1). The correspondence will not be exact, however, because 
of second-neighbor, and higher, effects and also because of thermal expan¬ 
sion when T > 0°K. 

Each molecule has three degrees of freedom—translational degrees which 
have degenerated into vibrational degrees because of the cage of nearest 
neighbors. The vibrations about equilibrium positions can be treated as 
small vibrations if the temperature is not too high. We limit ourselves to 
this case. Of course, as the temperature is increased, the vibrations will 
become more violent, lattice imperfections and molecular migrations will 
become more frequent, and eventually the crystal will melt. 

We assume for the sake of simplicity that each molecule vibrates in its 
own cell independently of the vibrations of its neighbors. Actually the 
motions of neighbors are coupled to each other (Section 5-2), but we 
ignore this complication here. To compute the potential field in which a 
given molecule moves, we can assume, say, that all other molecules in the 
crystal are frozen in their equilibrium positions. This potential will be 
not quite spherically symmetrical. A feature of the Einstein model is to 
insist on spherical symmetry as a further slight approximation. In other 
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words, the vibrational motion of a central molecule is assumed isotropic 
about the equilibrium point. 

Even though this approach is approximate, it provides a quite straight¬ 
forward connection between intermolecular forces and the potential, 
referred to above, which plays a crucial role in the model. In the canonical 
ensemble, the exact geometry (numbers and distances of neighbors) of a 
cell or cage is fixed by the molecular volume V/N (or number density 
N/V) and the assumed type of crystal lattice. 

Let <p(r) be the potential of the central molecule (Fig. 5-1 shows a 
simplified one-dimensional version), where r is the distance from the center 
of the cell (equilibrium point). The zero of energy is infinitely separated 
molecules at rest. We expand <p(r) about r = 0, and discard cubic and 
higher terms: 

<f>(r) = <c( 0 ) + ifr 2 -I- 

or 

*(*, V, *) - *(0) = if(x 2 + y 2 + Z 2 ) + • • •, (5-1) 


where x, y, and z are cartesian coordinates with the center of the cell as 
origin, and the force constant / = ( d 2 <p/dr 2 ) r=m0 • The linear term is 
missing because dtp/dr = 0 at r = 0. The restriction to small vibrations 
enters here when we drop higher terms in the expansion of <p. The classical 
motion of the central molecule of mass m in the potential field (Eq. 5-1) is 
that of a three-dimensional isotropic harmonic oscillator with frequency 


v = 



(5-2) 


The curvature of <p at r = 0 (that is, /) is clearly a function of V/N; 
therefore v is also a function of V/N. Of course the same is true of <p(0). 

In this model, by assumption, the total Hamiltonian for the system of 
N molecules is the sum of N independent and equivalent Hamiltonians, 
one for each molecule. Furthermore, for each molecule, we have by 
Eq. (5-1) that the x-, y-, and z-motions are independent and equivalent. 
Altogether, then, the system decomposes into an aggregation of 3 N 
independent one-dimensional harmonic oscillators, all of classical fre¬ 
quency v. These are the “subsystems” in the present model. The partition 
function for the system is 

Q(N, V, T) = e-^rio-.vim/stT q (7/W) ( 5 - 3 ) 


where q is the partition function of a one-dimensional harmonic oscillator 
of classical frequency v, with zero of energy at the bottom of the parabolic 
potential well [as in Eq. (5-1)]. The factor e ~ N ^o)/ 2 kT ^ gq ( 5 - 3 ) 
would be the partition function of a hypothetical system with all molecules 
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at rest at their lattice points; N<p( 0)/2 is the potential energy (a negative 
quantity) of such a system relative to a zero at infinite separation of all 
molecules. The factor of two is necessary to avoid counting each inter- 
molecular interaction twice. In the complete Q in Eq. (5-3) the zero of 
energy corresponds to all molecules infinitely separated and at rest. The 
electronic ground state of the atoms of the crystal is assumed to be non¬ 
degenerate in Eq. (5-3). 

Next, we have to find q. The energy levels of a one-dimensional harmonic 
oscillator are nondegencrate, and given by 

«»=(« + i)hv, n = 0, 1, 2, • • •, (5-1) 


where v is the classical frequency. Then 


-E 


_ —hv/2kT 


E ( e ~ hy,kT ^ 


»-o 


e -hPl2kT e hp/2kT 

1 _ 0 —hpfkT g hvfkT — 1 


e -QI2T 

= 1 — e -eiT' 


(5-5) 


(5-6) 


where 0 = hv/k and we have used the fact that e~ MkT < 1. The 
parameter 0 has dimensions of temperature and is called the “character¬ 
istic temperature.” It is a function of V/N. In a typical case © is of the 
order of magnitude of 300°K and v of order 6 X 10' 2 sec -1 . This fre¬ 
quency is about ten times smaller than the internal vibrational frequency 
iu a typical diatomic molecule (Chapter 8), primarily because the forces 
between molecules in a monatomic crystal are in general weak compared 
with chemical bond forces. 

At high temperatures, when T » 0, 


1 - (0/2T) + • • • T kT 

q "*■ 1 - [1 - (0/T) + •••]“* 0 hp ' 


(5-7) 


The same result is obtained if we integrate instead of sum in Eq. (5-5). 
This is legitimate if T » 0. Then 

q = e - hPl2kT f X (e - *"* 1 ')’* dn 



1 \ = kT 

In e-AW * T ) hv 


At low temperatures, q —» e hPl2kT (each oscillator is in its ground state). 
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We should emphasize that throughout this chapter, when we speak 
of the asymptotic behavior of various properties of a crystal at high tem¬ 
peratures we do not mean to imply that the temperature can be increased 
indefinitely. For in this case complications, not included in our model, 
would enter. For example, cubic and higher terms in the potential energy 
can no longer be neglected; crystal imperfections and the possibility of 
melting must be considered; etc. 

We now derive expressions for A, E, Cv, and S from Q in Eq. (5-3). 
We have 

A = -kT In Q = M). _ 3NkT ln ^ (5-8) 

and 

,5-9, 


The second term on the right in Eq. (5-9) is the zero-point vibrational 
energy, and the third term is the vibrational energy in excess of this 
minimum. At low temperatures, 


E 


and at high temperatures, 


Nx>( 0) , 3 Nhv 
~ 2 ~ + ^ 2 ~’ 


(5-10) 


E -* 3NkT. 


(5-11) 


In Eq. (5-10) E is the negative of the heat of sublimation of an Einstein 
crystal at 0°K (since E is zero for the infinitely dilute equilibrium vapor 
phase at 0°K). 

Only the last term in Eq. (5-9) contributes to the heat capacity: 

Cr “ (f L - 3m (?)’ <^T)i ■ 

We see that Cv/Nk is a universal function of T/Q, according to this 
model. This function is shown in Fig. 5-2. In general 0 is different for 
different crystals and also varies with V/N for the same crystal. But if 
the temperature scales are properly compressed or expanded, all experi¬ 
mental Cv/Nk curves, at least for monatomic crystals, should coincide. 
This is an example of a law of “corresponding states”: for two crystals 
with ©i and © 2 , if the first crystal is at T u then the “corresponding tem¬ 
perature” T 2 for the second crystal is T 2 = T i©2/©i; the two crystals 
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T/B or T/Bd 

Fia. 5-2. Heat capacity of monatomic crystals according to the theories of 
Einstein and Debye. 

have the same value of Cr/Nk at “corresponding temperatures'’ T\ and TV 
Actually, it is found that experimental Cy/Nk curves for monatomic crys¬ 
tals can be made practically to coincide by adjusting the temperature 
scales in this way, but the common experimental heat capacity curve thus 
obtained does not fit the Einstein function exactly, though it does approx¬ 
imately. We shall return to this point in Section 5-3 in connection with 
Debye’s theory. 

The limiting forms of Eq. (5-12) are 

Cy —» 3JVfc as T -► oo, (5-13) 

C v - 3Nk e -8/r as T —► 0. (5-14) 

The high-temperature limit agrees with experiment (“law of Dulong and 
Petit”), but Cy approaches zero much more rapidly in Eq. (5-14) than 
is the case experimentally. Experimentally, Cy « T 3 as T —* 0 (see 
Section 5-3). 

From Eqs. (5-8) and (5-9) for A and E we find 

S = f ~ i “ 3Nk ~ ,n C 1 - ■ (5-15) 

As T —» 0, /S —» 0. That is, D —» 1. Here Q refers to the total number of 
quantum states of a system of N independent and distinguishable three- 
dimensional isotropic harmonic oscillators (or 31V one-dimensional oscil¬ 
lators). As T —» 0, the system sinks into its lowest energy state (with 
Q=l): each molecule is in its vibrational ground state [n = 0 in 
Eq. (5-4) for ail 31V vibrational degrees of freedom]. 

We can easily set down a formal expression (Problem 5-1) for the 
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pressure p by differentiating In Q with respect to volume, as usual. Spe¬ 
cifically, we recall that both <p(0) and 0 are functions of V/N. But we do 
not pursue this matter further, since we have not written out here explicit 
expressions for #>(0) and 0. We might comment, however, that the 
Lennard-Jones and Devonshire theory of the equation of state of a liquid, 
presented in Chapter 16, is an example of this kind of calculation. 

Actually, for a condensed phase such as a crystal, pV is in general small 
compared to A, E, etc., so that we can use the approximations F = 
Np = A, H z= E, etc. In view of the crude nature of the Einstein model 
itself, the dropping of pV cannot be considered serious. 

The chemical potential p (Problem 5-1) also involves derivatives of 
#>(0) and 0 with respect to V/N. However, an approximate p follows 
immediately from Np = A and Eq. (5-8): 

^rf- 8trh ( iC'r )- (S- 16 ) 

Finally, we make a few comments on the application of the grand parti¬ 
tion function to an Einstein crystal. In the first place, the sum 3 = 
Q(N, V, T)\ N , where Q is given by Eq. (5-3), cannot be carried out 
explicitly in general because of the dependence of ^(0) and 0 on N (F 
constant). So there is no advantage in using 3 instead of Q. Secondly, a 
simplified version of the Einstein model is often used in which the crystal 
is assumed to be incompressible. Here it would appear that S might be 
useful, but there are complications which again lead to the conclusion that 
Q is the partition function of choice. In an incompressible crystal with a 
given type of lattice, the cell geometry (nearest-neighbor distance, etc.) 
is regarded as preassigned and fixed. Since <t>(0) and 0 should now be 
considered constants, the sum Q(N, V, T)\ N becomes easy. However, 
F is no longer an independent variable: F = Nv, where v, the volume 
per molecule, is constant. That is, F is simply proportional to N. The 
basic thermodynamic equation for A becomes 

dA = -SdT + pdN, 
and A = Np. We still have 

A{N,T ) = —kT In Q(N, T), 
but 

In 7 ')A a = In £ [«"*° ),2iT q(T)*\f 

N N 

— function of p and T = In [e“‘ 4(Af *’ r)/ * 7 y tf *' ,/i7 ’] 

= 0 , 
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just as in Eq. (1-89) (where there is one additional independent thermo¬ 
dynamic variable). Thus, because of the loss of the volume as an inde¬ 
pendent variable, the grand partition function requires special instead of 
routine handling.* One finds without difficulty, by methods* which we 
shall not discuss here, 

I = e ~ rml2kT q(T) 3 . (5-17) 

This result also follows, but much more simply, from 

Np — A = -kT In Q (5-18) 

or 

'•--“'(tv s ),- (5 - 19 > 

Equation (5-17) is the same as Eq. (5-16). Thus dropping the pV term 
leads to the same p as assuming incompressibility. 

5-2 General treatment of molecular vibrations in a monatomic crystal. 
Let us begin this section by indicating how, in principle, one would handle 
the problem of small vibrations in a crystal exactly. The nature of the 
problem is clearly exhibited even by a one-dimensional crystal, so we 
consider this relatively simple case (Fig. 5-3). For concreteness, suppose 
only first- and second-neighbor interactions are significant. Let u(r) be 
the intermolecular pair potential for any two atoms of the crystal (Figs. 
IV-1 and 5-1). Let there be N atoms (N is very large, so end effects are 
unimportant) in a length L, and a = L/N. 

o~- 2 —~ o © •—-© O *0 © 

t - 3 i - 2 t— I « i + 1 i + 2 ( + 3 

Fra. 5-3. One-dimensional crystal. Dots represent lattice or equilibrium 
points; circles represent atoms. 

If all the atoms are at their lattice points, the nearest-neighbor separa¬ 
tion is a. In general, let be the position of atom i with its own lattice 
point chosen as origin. The distance between atom i and atom i + 1 is 
a + x, +1 — Xi, etc. The total potential energy of the crystal is then 
(neglecting end effects) 

U{x) = ^ [u(a + x,- +1 — x-i) + «(2a + x i+2 — *<)]. (5-20) 

»— 1 


* S. M., Chapters 2 and 3. 
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With all atoms at lattice points, each x,- = 0, and 

17(0) = N[u(a) + «(2o)]. (5-21) 

We might digress briefly to indicate the relation between Eq. (5-20) 
and the Einstein model of the previous section. First, U( 0) corresponds 
to Nip( 0)/2. Second, the equation of motion of molecule j, according to 
Eq. (5-20), is 

dU 

m$j = — ^ = —u'(a + Xj — Xj-i) — «'(2 a + xy — x,_ 2 ) 

+M'(o + Xj+i — Xj) + «'(2o + Xj +2 — Xj). 

To “uncouple” the motion of molecule j from that of molecules j — 2, 
j — 1, j + 1, and j + 2, as is required by the Einstein model (each mole¬ 
cule vibrates independently), we can assume as an approximation that 
these neighboring molecules are all fixed at their lattice points: xy_ 2 = 0, 
etc. Then the equation of motion becomes 

m&j = —u'(a + Xj) — w'(2o + Xj) 

+u'(a — Xj) + u'(2a — xj) 

_ d<p{xj) 

~ dXj ’ 

where 

<p(xj) = it(a + Xj) + «(2a + xj) + u(a — Xj) + u(2a — Xj). 

This potential for molecule) corresponds to <p(r) in Eq. (5-1). Also, 

<p( 0) = 2[w(a) + w(2a)], 
as already indicated above. 

Returning now to Eq. (5-20), we can expand quantities of the form 
tt(o + 5) and «(2a + ij), which appear in Eq. (5-20), in powers of 8 and ij: 

u(a + S) = u(a) + u'(a) 8 + £w"(o) 8 2 + • • •, 

u(2a + ij) = u(2a) + w'(2a)ij + $u"(2a)ij 2 + • • •. 

If we substitute expansions of this kind (valid for small vibrations) for 
m(o + Xi — Xj—i), etc., in Eq. (5-20), we find that 

U(x) — V(0) + ^ (tenns of type x 2 , x,x,_i, x<x,_ 2 , x,Xj +u x»a: <+2 ) 


+ ••• . 


(5-22) 
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Here U(0)/N and the coefficients of the quadratic terms are functions 
of a only. The linear terms cancel because U{x) has a minimum at 
Xi = 0 (all i). 

The kinetic energy is simply mx*/2, a sum of independent terms, 
one for each coordinate a;,-. The total energy H(x, x) is the sum of U(x) 
and the kinetic energy. Unfortunately, because of the cross terms 
(x,Xi-.i, etc.) in Eq. (5-22), H — (7(0) is not separable, as in Eq. (3-1) 
[(7(0) itself is not important in the present connection—it has to do 
only with locating the zero of energy]. Hence, as matters stand, we do 
not have a system of independent subsystems as defined in Chapter 3. 

The situation is saved, however, at least in principle, by a mathematical 
theorem which states that a linear transformation to a set of new co¬ 
ordinates, 

fi = «uzi + 0112X2 + • • • + ainxif, 

£2 = OC 21 X 1 + <*22*2 + • • • + OC2NXN, (5-23) 

(if = OtfflXi + an 2^2 + • • • + CtNNXN, 


can always be found* such that the energy in the new coordinates, H((, |), 
retains the “diagonalized” form (i.e., no cross terms) of the kinetic energy 
and also has a diagonalized potential energy U((). That is, H((, £) con 
be written 

H(Z, () = U( 0) + MJl (5-24) 

i i 

where the /,• are “effective” force constants, functions of the coefficients 
of the quadratic terms in Eq. (5-22) [which in turn depend on u"(a) 
and u"(2a)], and the Mi are “effective” masses, functions of to. The 
function H — (7(0) is now separable. For each of the new coordinates 
(called normal coordinated), there is an independent contribution, 

Hi = i fig + Will (5-25) 

to H — (7(0). Hence, as in Section 3-1, the Schrodinger equation for 
the system is separable into N independent Schrodinger equations, each 
derived from an expression of the form (5-25). The Hamiltonian function 
Hi is in fact that of a harmonic oscillator and leads to nondegenerate 
energy levels 

(in — (n + i)hvi, n = 0,1, 2,..., 


* See L. Paulino and E. B. Wilson, Jr., Introduction to Quantum Mechanics. 
New York: McGraw-Hill, 1935, pp. 282-290. 
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where 

Each frequency Vi is a function of the thermodynamic variable a = L/N. 

Since this is a system of distinguishable molecules (each molecule 
restricted to the neighborhood of a labeled lattice point), no factor Nl 
appears in Q. We have, as in Eq. (3-5), 

Q = c U(0)lkT JJ q(e , )f (5-27) 

i—1 

where 0< = hvi/k (for each v,) and g(0,) is the harmonic oscillator parti¬ 
tion function, Eq. (5-6), for characteristic temperature 0,. With the 
factor e ~ u(0)lkT included in Q, the zero of energy corresponds to infinitely 
separated atoms at rest. 

The conclusion we reach (and it is the same for three-dimensional as 
for one-dimensional crystals) is that, despite the importance of inter- 
molecular forces in a monatomic crystal, small vibrations in the crystal 
can be decomposed rigorously into independent normal modes of vibration. 
Hence we are dealing here with a system of independent subsystems. This 
essentially solves the problem in principle [see Eq. (5-27)], but in practice 
we still have to find the N frequencies v„ a very difficult mathematical 
problem in most cases. 

The normal coordinate problem also arises in studying the internal 
vibrations of polyatomic molecules (Chapter 9). In this case chemical 
bonds hold the atoms together, so the vibrational force constants are 
generally larger than in a crystal. Also, the number of atoms in a molecule 
is relatively small, so “edge effects” cannot be neglected, as in a macro¬ 
scopic crystal. In principle, of course, there is no distinction between a 
crystal and a polyatomic molecule. That is, a crystal may be regarded as 
a giant molecule, usually with relatively weak bonds. If there are N atoms 
in a monatomic crystal, there are 3 N degrees of freedom altogether, of 
which three are associated with the translational motion of the whole 
crystal and three more are concerned with the rotation of the crystal. 
There are then 3 N — 6 vibrational degrees of freedom. But with 
N — 0(I0 2 °), we can take this number of vibrational degrees to be 3 N, 
without noticeable error. 

To illustrate the basic principles involved in normal coordinate analysis, 
in Appendix V we work out the problem in detail for an example of a 
hypothetical one-dimensional triatomic molecule. 

Let us bypass the normal coordinate question temporarily and continue 
a little further with the formal analysis, assuming the normal frequencies 
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Vi arc known. In three dimensions, we have 

3 w 

Q = e ~ N * (0)l2kT ]J «(©.), (5-28) 

i— 1 

where there are now 3 N normal modes of vibration, $(©,) is given by 
Eq. (5-6), and we have defined <p(0) as 2U(0)/N. The quantity <p(0) has 
the same physical significance as in the preceding section: <p(0) is the 
potential energy of interaction between one particular molecule and the 
other molecules in the crystal when all molecules are at their lattice points. 
It should be understood that Eq. (5-28) is not restricted to any specific 
model, such as Eq. (5-20), for example. Rather, a normal coordinate 
analysis of the small vibrations in the crystal is assumed to have been 
carried out whatever the nature of the forces between the molecules. From 
a thermodynamic point of view, <fi(0) and each ©,- are functions of V/N. 

Because of the very large number, 3 N, of frequencies Vi, it is convenient 
and legitimate to introduce a continuous frequency distribution g(v) such 
that g{v) dv is the number of normal modes with frequencies between v 
and v + dv. Then, from Eqs. (5-6) and (5-28), 




where 


/' 


g(y) dv = 3 N. 


(5-29) 

(5-30) 


Thus we no longer need to know all the separate v/s; the function g(v) 
is sufficient. In Eq. (5-29), <p( 0) and g(v) are functions of V/N; a more 
explicit notation would be g(v; V/N). 



(a) 0>) 


Fig. 5-4. Frequency distribution g(v) for crystal, (a) Einstein approxima¬ 
tion. (b) Debye approximation. 
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From Q above, one can now derive equations for thermodynamic func¬ 
tions such as E, Cv, P, P, etc., which are formally exact but not very useful 
unless g(i>; V/N) is known, which is not usually the case. For example, 
we find for the heat capacity, 


Cv = 



(hv/kT) 2 e k " kT g(v) 

( e h*lkT — 1)2 


dv > 


(5-31) 


which is just the sum, as we should expect, of ZN terms of the Einstein 
type (Eq. 5-12). 

In present notation, it is clear that Einstein’s rather crude approxima¬ 
tion (Section 5-1) amounts to taking g(y) as a Dirac {-function (Fig. 
5-4a). The discrepancy between Einstein’s g(v) and more accurate fre¬ 
quency distributions will become apparent in Sections 5-3 and 5-4. 

Attempts that have been made to calculate g(v) exactly, from various 
models concerning the intermolecular forces, will be summarized briefly 
in Section 5-4. However, because of the mathematical difficulty of the 
problem, this rigorous approach is not very fruitful in practice. Therefore 
we shall first consider the very successful approximation of Debye, which 
is a compromise between Einstein’s extremely simple model and an exact 
treatment. 


5-3 The Debye approximation. In classical mechanics, when a single 
normal mode of vibration of a crystal is excited, the value of the asso¬ 
ciated normal coordinate varies periodically with time. As is pointed out 
in Appendix V, in such a normal mode each atom in the crystal vibrates 
about its equilibrium position with the same period (or frequency) and 
phase as that of the normal coordinate. Figure 5-5 shows two possible 
normal modes for a one-dimensional crystal, one with short “wavelength” 
(X = 2a) and one with relatively long wavelength (X = 10a). In normal 
modes with wavelengths very large compared with the lattice spacing, 
the atomic or discrete nature of the actual crystal is washed out and 
becomes an unimportant feature. The atoms are so close to each other 
relative to the length of the wave that the wave “sees” the crystal essen¬ 
tially as a continuum. Normal modes with long wavelengths may there¬ 
fore be considered elastic waves in a virtual continuum. 


(a) X <■ 2a: o-*- -*o 


o* -*o o» -*o »•• 


(b) X = 10a: o o» o-*- o-*- o* o <o ■«-o «-o «o o 

Fig. 5-5. Two normal vibrational modes for a one-dimensional crystal. 
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This is the origin of Debye’s approximation. The above remarks about 
a continuum are exact for sufficiently long wavelengths (or low frequencies, 
see Appendix VI). Thus the correct asymptotic form of g(v) for small v 
can be deduced by treating the crystal as an elastic continuum. The 
approximation of Debye is to assume that this low-frequency form of g{y) 
is correct for all frequencies. 

It is shown in Appendix VI that in a three-dimensional continuum 
g(v) = av 2 , where a is given by Eq. (VI-30). This is, then, the explicit 
low-frequency form of g(v) that Debye adopts for all frequencies. The 
total number of normal modes is 3 N, so a cut-off at a maximum frequency 
v m is necessary (Fig. 5—4b). That is, 


g(v) dv = 3N = ~> 
o(y) = (o < v < v m ) 

= 0 (v > v m ), 


(5-32) 


as in Eq. (VI-34). From a thermodynamic point of view, v m in Eq. (5-32) 
is a function of V/N. With the complete frequency distribution assigned 
in Eq. (5-32), thermodynamic functions follow in a routine way from 
Eq. (5-29). 

Before turning to the thermodynamic functions, let us digress briefly 
to make a comment on the range of validity of Debye’s assumption 
g(v) = av 2 , which we know to be accurate at sufficiently low frequencies. 
The question is, what do we mean by sufficiently low frequencies? 
Normal modes with wavelengths X » (V/N) 113 , where (V/N) 113 is of 
the order of the lattice spacing, are treated accurately by a continuum 
theory; say, rather arbitrarily, X > 10(V/V) 1/3 . If we use X = v%/v 
and = 0[(N/V) 1,3 v 3 ] from Eqs. (VI-22) and (VI-33), we have the 
condition v < v m /10 for the range of validity of g(v) = av 2 . See also 
Figs. 5-8 and 5-10, below, where the exact g(v) can be compared with the 
Debye g(v). 

For the energy, according to Debye, we have 


E = 



N<fi(0) , 9 NkT r ( hv , 
~ 2 + v 3 Jo \2kT + 


hv/kT \ 

e h,lkT _ 1/ 


v 2 dv 


(5-33) 


N<y( 0) , 
2 " r 


9NkT [ U (x 
us Jo \2 ' 



(5-34) 
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x ~ kT’ u ~ kT 


(5-35) 


The resemblance between Eq. (5-33) and Eq. (5-9) for the Einstein model 
should be noted. One of the integrals in Eq. (5-34) is easy, but the other 
has to be evaluated numerically: 


E = 0) + 9 Nhpn + 3NkTD ( u ) t 


where 


D(u) = DQivJkT) 


_ 3_ f x 3 dx 
— u 8 Jo e* — 1 


(5-36) 


(5-37) 


In Eq. (5-36), <p(0), v m , and u are functions of V/N. The asymptotic 
behavior of D(u) is: 

D{U) -*^L^I = ^ as T -* 0 and u —> co, (5-38) 


u* Jo (1+ * + •••)- 1 


—» 1 as T —► oo and u —» 0. 


Therefore 


„ . N<p(0) , 9 Nhv m , 3Nit*hv m ( kT V 
E ~2~~ + 8 + “ 5 — \toJ 


(5-39) 


as T —► 0. (5-40) 


The second term on the right is the vibrational zero-point energy (Problem 
5-2). Also, 


E -► + 3 NkT as T -» 00 . 


(5-41) 


This is the same as Eq. (5-11) (Einstein model), and is the energy predicted 
by classical statistical mechanics (Chapter 6 ). 

For the heat capacity, we have (Problem 5-3) 


- mjW) - 


(5-42) 



5-3] 


THE DEBYE APPROXIMATION 


101 



which is often used. We call 8 d the “Debye temperature. ” It refers to 
the cut-off frequency v m . In the Einstein theory 0 refers to the only 
frequency v. 0j> is different for different crystals and is a function of V/N 
for any given crystal. According to Eq. (5-42), Cv/Nk is a universal func¬ 
tion of T/Qd. This function is shown in Fig. 5-2. Thus the Debye ap¬ 
proximation, like Einstein’s, leads to a law of corresponding states. The 
asymptotic behavior of CV is: 


12Nbr‘ 

5 




L __Y 

V 1 + U +-1/ 


(5-45) 


That is, the Debye theory predicts CV <* T 3 at low temperatures (the 
criterion is approximately T < 0^/12) and leads to the Dulong-Petit 
value of Cv at high temperatures. 

Monatomic crystals do, in fact, follow a law of corresponding states 
rather closely. Figure 5-6 shows a superposition of heat-capacity points 
for Al, Cu, Pb, and C (diamond) after suitable adjustment of the tempera¬ 
ture scale, taken from a paper by Lewis and Gibson. Many other sub¬ 



log (4.027yez>) 

Fig. 5-6. Superimposed experimental heat capacity points for Al, Cu, Pb, 
and C (diamond). 
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Table 5-1 

Low-Temperature Heat Capacities 
(Cr in ca]-mo]e _1 'deg _1 ) 


T, °K 

10 2 C{/ 3 

T 

r,°K 

,, 10 2 C{/ 3 
’ T 

32.0 

1.67 

19.1 

2.12 

33.1 

1.70 

23.6 

2.03 

35.2 

1.77 

27.2 

2.01 

38.1 

1.73 

32.4 

1.95 

42.0 

1.64 

33.5 

2.00 

46.9 

1.71 

35.1 

1.97 


stances, for example Ag, Hg, Tl, and Zn, also have curves that coincide 
with those included in the figure. Furthermore, the Debye function (5-42) 
fits the experimental curve, over the whole temperature range, in a very 
satisfactory manner. 

The T 3 law for Cv at low temperatures is also followed quite accurately, 
as seen in two typical cases in Table 5-1. According to Eq. (5-44), C]f 3 /T 
should be a constant; we see from the table that it very nearly is. We 
should expect the Debye theory to be essentially exact for the heat ca¬ 
pacity (and entropy) at low temperatures, because only the low-frequency 
modes are excited at low temperatures, and the Debye assumption 
g(v) = av 2 is correct for low frequencies (Problem 5-4). However, the 
vibrational zero-point energy term 9Nhv m /8 in Eq. (5-40) for E at low 
temperatures is not correct, because the Debye g(v) contributes to this 
quantity from v = 0 to v — v m . 

If we press the Debye theory a little harder by using more sensitive 
tests, discrepancies begin to show up, as they should since the theory is, 
after all, an approximate one. For example, Table 5-2 contains values 
of So obtained from different sources for the same substances. Ideally, 


Table 5-2 
0c Values (°K) 


Source of &d 

C 

Fe 

A1 

Cu 

Ag 

Entire experimental Cr curve 

1860 

453 

398 

315 

215 

T 3 part of Cv curve 

2230 

455 

385 

321 

— 

Experimental elastic constants 
and Eqs. (VI-15, 16, 33) 

— 

— 

402 

332 

214 
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T,°K 

Fig. 5-7. 6 d for silver as function of temperature. 


values of 6p should be independent of source. A more rigorous test is 
shown in Fig. 5-7. For each experimental value of Cv for silver, is 
calculated from Eq. (5-42), using the temperature of the Cv measure¬ 
ment. If the Debye theory were exact, points in Fig. 5-7 would lie near 
a horizontal line (0 d = constant, independent of T). Similar non¬ 
horizontal 0 b — T curves are found quite generally for other substances 
as well. 

Table 5-3 (from Blackman*) presents average 0 b values for a few more 
substances. From Fig. 5-2 we see that the classical Dulong-Petit value 
of Cv is practically reached at room temperature for many of these crystals. 
As mentioned in Section 5-1, a value of 0/» = 300°K corresponds to 


Table 5-3 
0B Values (°K) 


Element 

ei> 

Element 

Sd 

Element 

©D 

Li 

430 

Cr 

405 

Hg 

90 

Na 

160 

Ca 

230 

Be 


K 

99 

Mo 

375 

Mg 

330 

Au 

185 

Pt 

225 

Zn 

240 

Pb 

86 

W 

315 

Cd 

165 


* M. Blackman, llandbuch der Physik (Springer, Berlin) 7.1, 325 (1955). 
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v m as 6 X 10 12 sec _1 , which is small compared with typical diatomic 
molecule vibration frequencies. 

From Eqs. (5-29) and (5-34), we find for the entropy, 

S = ^ = ^/ o - Ml - e-*)]x 2 d*. (5-46) 

At low temperatures [put u = ac in the upper limit, as in Eq. (5-38)], 


S 


9 Nk , 7[A 
\15 ' 45/ 


4 ir*Nk( fV 
5 \@£»/ ' 


(5-47) 


Thus S —► 0 as T —> 0. In the nondegenerate (Q = 1, S — 0) ground 
state of the entire system (crystal), each vibrational mode is in its non¬ 
degenerate ground state. In writing Eq. (5-29), nuclear degeneracy is 
omitted as usual, and the electronic ground state has been assumed 
nondegenerate. 

Finally, let us consider the vapor pressure p 0 (T) of a monatomic crystal 
at very low temperatures. As T —► 0, po -* 0. Hence, for the crystal, 
we can drop the p 0 V term in p = (A + p 0 V)/N, and we can assume that 
the gas phase is ideal. Then, for the crystal, from Eqs. (5-40) and (5-47), 


where 



E - TS 
N 


—A 0 — 


Tr*kT 4 
5©i> ’ 



9 hv m 
8 


(5-48) 


(5-49) 


The quantity A 0 is the heat of sublimation per molecule at 0°K. Equa¬ 
tion (5-48) may be regarded as exact provided we use the experimental 
A 0 , since the zero-point energy term in Eq. (5-49) is not correct, as already 
mentioned. We may calculate 0p (Section 5-4) in Eq. (5-48), or it may 
be obtained from experimental elastic-constant or heat-capacity measure¬ 
ments at low temperature and low pressure. 

To obtain the vapor pressure p 0 , we use the thermodynamic criterion 
for phase equilibrium: we set /Crystal from Eq. (5-48) equal to p ga > from 
Eq. (4-24). We find 

tap. - far - ft - £(£)’ + (5-60) 


We should emphasize here that the same zeros of energy (isolated gas 
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molecule at rest) and entropy (Q — 1 for a system of point masses, i.e., 
the crystal, at T = 0) have been chosen for both phases. If a term in the 
ground-state nuclear degeneracy had been included in Mory»t»i and in /t ga s, 
these terms would cancel in Eq. (5-50). The electronic ground state has 
been assumed nondegenerate in both crystal and gas. 

In the equilibrium between a crystal and its vapor, as in all equilibria, 
competition and compromise between energy and entropy effects determine 
the equilibrium point. Here, the crystal is more stable from an energetic 
point of view (A 0 ); i.e., it has a lower energy than the gas because of 
intermolecular interactions in the crystal. But the gas is more stable 
insofar as the entropy is concerned; i.e., the gas molecules are in a much 
higher state of disorder and have a higher Q and entropy. The molecules 
distribute themselves between the two phases in such a way as to equalize 
the chemical potentials in the two phases: the chemical potentials depend 
on both energy and entropy. An equivalent statement is that the mole¬ 
cules distribute themselves between the two phases so as to minimize 
the Helmholtz free energy A of the combined system crystal + gas if 
V, N, and T for the combined system are constant (or F is minimized if 
p, N, and T are constant) (see Sections 10-1 and 10-2). 

The last term in Eq. (5-50) is a constant, call it t (the “vapor pressure 
constant”), the only term in the equation independent of temperature. 
Now i, which in thermodynamics appears as an integration constant on 
integrating the Clausius-Clapeyron equation for d In po/dT, can be evalu¬ 
ated experimentally by using data on the vapor pressure, heat of sublima¬ 
tion, and heat capacities of gas and crystal. Experimental values of i are 
found to agree* with theoretical values of i to within experimental error. 
Historically, this was a very significant check of the methods of quantum- 
statistical mechanics. Although i can be obtained as an experimental 
thermodynamic quantity, thermodynamics by itself is powerless, of course, 
to predict or explain i-values. Also, i cannot be deduced from purely 
classical-mechanical considerations (since h is included in i). 

5-4 Exact treatments of the frequency distribution problem. A detailed 
discussion of attempts to compute g(v) from first principles would be far 
outside the scope of this introductory book. However, we shall give here 
a brief summary, which the reader may find instructive, of exact, or 
practically exact, results obtained in certain special cases. Other sources 
must be consulted for details and derivations. We shall not discuss thermo¬ 
dynamic functions, since g{v) is available for just a few idealized models 
(Problem 5-5). 


* See Fowler and Guggenheim, pp. 199-202. 
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Fra. 5-8. Frequency distributions for one-dimensional crystal with nearest- 
neighbor interactions only. 

The first case we consider is a one-dimensional lattice with lattice 
spacing a and nearest-neighbor interactions only. The Hamiltonian is 

H = 1/(0) + l £ mil + { J (*<+i - *.) 2 - (5-51) 

This is a special case of the Hamiltonian associated with Eq. (5-22) (i.e., 
second-neighbor interactions are omitted). Specifically, the force constant 
/ in Eq. (5-51) is just u"{a) in the earlier notation (Problem 5-6). The 
frequency distribution problem here is rather easy and can be solved 
exactly. The result* is 

2AT 1 

gOO = — ( „2 _ y2)1/2 ’ (5-52) 

where 

The function g{v) is shown in Fig. 5-8. At low frequencies, 

g(")(5-54) 

71 Vm 

In the continuum theory (Appendix VI), which must agree with 
Eq. (5-54) at low frequencies, according to Eq. (VI-11) there are n 


* Mayer and Mayer, pp. 246-248. Sec also Blackman, loc. cit., pp. 330-331. 
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modes between v = 0 and v = vin/2Na. That is, 



Equating g in Eqs. (5-54) and (5-55), we find 
Vi = 7 rav m = aVJJm. 


(5-55) 


(5-56) 


This equation expresses the macroscopic quantity »i (“velocity of sound”) 
in terms of the molecular parameters a, f, and m. In the Debye approxima¬ 
tion for a one-dimensional crystal, g is a constant out to a cut-off frequency 
that we denote here, to avoid confusion, by p®. We have 


or 


,4-if.SSSs. 



(5-57) 


This is larger than the true v m by a factor 1.57 (Fig. 5-8). 

Next, suppose we have a linear crystal with lattic spacing a, force 
constant / (nearest-neighbor interactions only), but with two alternating 



Fiq. 5-9. Exact frequency distribution for one-dimensional crystal with 
mi/m 2 ** 3. 



108 


MONATOMIC CRYSTALS 


[CHAP. 5 


kinds of atoms with masses mi and m 2 . This problem can also be handled 
exactly without difficulty.* We shall not give any details, but merely 
show in Fig. 5-9 a plot of g(v) against v for mi/m 2 = 3. There are two 
branches in g{v). The low-frequency branch is called the “acoustical” 
branch, and the high-frequency branch the “optical” branch. In normal 
modes belonging to the acoustical branch, neighboring atoms are dis¬ 
placed in the same direction (as in Fig. 5-5b). In the optical branch, 
neighboring atoms are displaced in opposite directions (as in Fig. 5-5a). 
A gap between branches, as in Fig. 5-9, is common in one-dimensional 
crystals with two kinds of atoms, but not invariable. 

In a very elegant paper, Montrollf was able to find the exact g(v) for 
a particular two-dimensional crystal. The lattice studied was simple 
square, with one kind of atom, and lattice spacing a. First- and second- 
neighbor interactions were taken into account. The parameters of the 
problem are a, the mass m, the nearest-neighbor force constant / = u"(a) 
(called a by Montroll), and second-neighbor force constant 27 = «"(2 l/2 o). 
It is convenient to introduce 

T = 1 + (//2T) ’ (5_58) 

When r = 0, second-neighbor interactions are absent. An exact analytical 
expression for g(v) was found by Mon troll in the special case r = 1/3, 
or//27 = 2. This value of t corresponds to elastic isotropy. The function 
g(v) is shown in Fig. 5-10. The low-frequency behavior of g{v) agrees 
with continuum theory, g v. 

g(v) -* as v -» 0, (5-59) 

where the maximum frequency v m is given by 

On comparing Eqs. (VI-35) and (5-59), we find (Problem 5-7) 

*- 3 & (M1) 

If we calculate v% as in Eq. (5-57), we find here v%/v m = (jt/2) 1/2 = 1.25 
(Problem 5-8), as indicated in Fig. 5-10. 


* Blackman, loc. cit., pp. 331-333. 

fJ. Chem. Phys. 15, 575 (1947). Sec also M. Smollett, Proc. Phys. Soc. 
London 65A, 109 (1952). 
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Fig. 5-10. Frequency distributions for special case (r = $) of two-dimen¬ 
sional square crystal. 



Fig. 5-11. Frequency distribution for special case (t = 3/40) of a simple 
cubic lattice. 


The most accurate work on a three-dimensional case has been done by 
Newell.* Newell studied a one-component, simple-cubic lattice with 
lattice spacing a and first- and second-neighbor interactions. Again the 
force constants are / = it"(a) and 27 = w"(2 ,/2 a). We define 


1 

T 2 + (//27)' 


(5-62) 


Although Newell did not obtain an exact g(v) for any r, his calculations 
of g(v) are accurate to one or two percent for r up to almost 1/10. Further¬ 
more, he was able to describe the exact location and type of singularities 
in g(v) for 0 < r < 1/10. Figure 5-11 shows g(v) for r = 3/40. There 
are five singularities in the slope of g(v), but g(v) itself is finite and con- 


* G. F. Newell, J. Chem. Phys. 21, 1877 (1953). 
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tinuous everywhere. The low-frequency behavior of g(v) is g{v) « v 2 , 
as expected. 

Finally, we mention that Van Hove* was able to come to the following 
definite and quite general qualitative conclusions about g(v) for two- or 
three-dimensional crystals, by a topological argument. In two dimen¬ 
sions, g{v) has at least one logarithmic infinity in each frequency branch 
and at least a finite discontinuity occurring at v = v m . Montroll’s results 
are in agreement with this statement. In three dimensions, g(v) is con¬ 
tinuous everywhere, but dg/dv has at least two infinite discontinuities, 
and dg/dv = — oo at v = v m . 


Problems 

5-1. Derive equations for p and p for the Einstein model, considering <p(0) 
and 9 functions of V/N. Show that (dA/dN)v,r and ( A + pV)/N give the 
same expression for p. (Page 91.) 

5-2. Show that 9Nhv m /8 is the zero-point energy for the frequency distribu¬ 
tion (5-32). (Page 100.) 

5-3. Work out the details of the derivation of Eq. (5-42) for the Debye Cv. 
(Page 100.) 

5-4. Show that, in the Debye theory, the number of excited [i.e., n > 0 
in Eq. (5-4)] vibrational modes in the frequency range v to v + dv, at tempera¬ 
ture T, is proportional to x 2 e~ x , where x = hv/kT. The maximum in this 
function occurs at a frequency v' = 2 kT/h) hence / —► 0 as T -* 0. (Page 102.) 

5-5. If an exact calculation .of g{v) could be made for a number of different 
monatomic crystals, would a law of corresponding states be expected for Cv? 
(Page 105.) 

5-6. Show that the force constant / in Eq. (5-51) is equal to u"(a). (Page 
106.) 

5-7. Verify Eq. (5-61) for V 2 . (Page 108.) 

5-8. Show that v%/v m = (*■/2) 1/2 in Montroll’s two-dimensional case. 
(Page 108.) 

5-9. Consider a one-dimensional lattice with lattice spacing r*. All atoms 
are at equilibrium points except the “central” 'atom, which moves in the po¬ 
tential field of its two nearest neighbors. This is a one-dimensional Einstein 
model. Take Eq. (IV—1) for the intermolecular pair potential. Find <p( 0) and 
/ in the one-dimensional form of Eq. (5-1) in terms of r* and t. 

5-10. Consider a system of one-dimensional oscillators, all with characteristic 
temperature 9 [Eqs. (5-4) and (5-6)]. Derive an expression for the fraction P» 
of these oscillators in the energy level e n . Calculate Po, Pi, and P2 for T = 49 t 
T = 9, and T = 9/4. 


* L. Van Hove, Phys. Rev. 89, 1189 (1953). 
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5-11. Derive an equation for the vapor pressure po(T) of an Einstein crystal, 
assuming the vapor is an ideal gas and using Eq. (5-16). What choice of V/N 
in <p(0 ) and 8 is appropriate? 

5-12. The heat capacity Cv of a monatomic solid at 300°K is 2 B per mole. 
Use the Einstein theory to calculate the frequency v. 

5-13. Use thermodynamic connections between (a) E and Cv and (b) S and 
Cr to check the self-consistency of Eqs. (5-40), (5-44), and (5-47). 

5-14. Calculate 0j> for Fe from the value of Cy at 32.0°K given in Table 5-1. 

5-15. Use Fig. 5-2 and the 0j> values in Table 5-3 to estimate Cr in 
cal-mole -1 "deg -1 for Pb, Mg, and Be at 25°C. 


Supplementary Reading 

Blackman, M., Handbuch der Physik (Springer, Berlin) 7.1, 325 (1955). 
Fowler and Guggenheim, Chapter 4. 

Mayer and Mayer, Chapter 11. 

Slater, Chapters 13, 14, 15. 

Tolman, Chapter 14. 
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CHAPTER 6 

CLASSICAL STATISTICAL MECHANICS 

It is well known that quantum-mechanical results for a given system 
go over asymptotically into classical mechanical equations in the limit 
of large quantum numbers. Also, in the quantum-mechanical canonical 
ensemble partition function (for one or many particles), terms correspond¬ 
ing to the higher quantum numbers make more and more important 
contributions to the sum as the temperature increases. We may anticipate, 
then, that at sufficiently high temperatures, the quantum partition func¬ 
tion should approach asymptotically a partition function developed from 
a classical mechanical starting point. 

We consider classical statistical mechanics in this chapter. No results 
are obtainable from classical statistics which cannot be found as limiting 
laws from quantum statistics, but often the classical method is easier to 
use. We shall, however, refrain from discussing the principles of classical 
statistical mechanics in any detail. Although this is a very elegant subject, 
we shall adopt the point of view here—for lack of space—that the quantum 
method provides the more general postulatory foundation and hence that 
classical statistics does not require separate development since it follows 
as a special case from the quantum postulates. The reader interested in 
the principles of classical statistical mechanics per se cannot do better 
than to read Tolman’s masterly exposition. 

The treatment of the transition from quantum to classical statistics in 
this chapter will be inductive, and proofs will be omitted for the more 
complicated cases. This subject will, however, be discussed in a more 
general way in Chapter 22. 

6-1 Introductory examples. We consider two special cases in this 
section. The first is a one-dimensional harmonic oscillator, with mass m 
and classical frequency v, restricted to the 2 -axis (equilibrium point x — 0). 
The particular question of interest is, what is the classical analog of the 
quantum-mechanical equation (5-5), 

q= e ~' nlkT ? (6-1) 

n-0 

This is a sum of e -(energy)/i7 ’ over all possible quantum states of the system. 
The corresponding classical expression is the sum (or actually integral, 
since the classical state can vary continuously) of e -<energy)/ * 2 ’ over all 
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possible classical states of the system. The classical energy is just the 
Hamiltonian function H(q, p), which for this system is 

H{q, p) = ifx 2 + inti 2 

= 2t r 2 mpV+^> (6-2) 

where / is the force constant [see Eq. (5-2)], q = x, and p = mx (Ap¬ 
pendix VII). We use q and p as independent variables for H because the 
equations of classical statistics which we shall use below turn out to be 
simpler with this choice, as one might anticipate from Eqs. (VII-7). 
The classical state (position and velocity) is specified by assigning values 
to q and p. Both variables range continuously from — oo to +ao. Hence 
we have 

friau = e ff e ~ U(q,p)lkT dq dp, (6-3) 

where c is a constant. We have to choose c so that the classical equation 
(6-3) gives the same result as the high-temperature limit of the quantum 
equation (5-6), namely, q -* kT/hv. The fact that Planck’s constant h 
appears in this limit shows that c could not possibly be deduced from 
purely classical considerations. What we are essentially doing here, in 
evaluating c, is establishing a statistical-mechanical correspondence prin¬ 
ciple for this special case. 

We substitute Eq. (6-2) in Eq. (6-3) and carry out the two inte¬ 
grations. We find q 9 i„, = ckT/v. Therefore c = 1/h in order to satisfy 

4dau = q (T ► OO). 

The q, p space over which the integration is carried out in Eq. (6-3) is 
called the phase space of the system, and the integral itself is called the 
phase integral. Any point in phase space corresponds to a definite classical 
state. In quantum mechanics, such a precise (point) specification of state 
is not possible. According to the uncertainty principle, the state of the 
system cannot be located in phase space more closely than within an area 
A q A p of order h. This is consistent with the value just found for c, because 

h £ e ~ imnsy)lkT —* Jf e- <eoergy>/ * r dgdp (6 h1 ) 

quantum classical 

states phase space 

implies that the volume of classical phase space corresponding to one 
quantum state, in the limit of large quantum numbers, is h. The precise 
value h cannot, however, be deduced from the uncertainty principle itself, 
because this principle reads, to be exact, AgAp > h/iir (where Aq and 
Ap are standard deviations). 
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Incidentally, the relation of Eq. (6-4) to the uncertainty principle con¬ 
firms the convenience of the choice of independent variables q, p (instead 
of, say, q, 4 or some other choice). 

An alternative deduction of the factor h in Eq. (6-4), or 1/A in Eq. 
(6-3), is the following. Let us draw paths of constant energy, H(q, p ) = 
e = constant, in classical q, p space. From Eq. (6-2), we see that these 
paths are ellipses 


where 


„2 2 

— 4- — = 1 
<*2 ^ 02 1( 


(6-5) 


a 


2 


€ 

2ir 2 mv 2 ’ 


0 2 = 2 me, 


and the area of an ellipse is 

Area = iraB = - • 
v 


( 6 - 6 ) 


In particular, let ellipses be drawn, as in Fig. 6-1, for energies e = €»_i, 
c n , and e„+i, where «„ is the quantum-mechanical energy level (Eq. 5-4) 
for a large quantum number n. The area between two successive ellipses 
in Fig. 6-1 is then the area of classical phase space associated with one 
quantum state. We find from Eqs. (5-4) and (6-6), 


Area n+1 — Area„ = (n + %)h — (n + $)A = h, (6-7) 


which confirms our previous result. To pursue the matter a little further, 
suppose we assign an area A/2 on either side of the e = e„ ellipse to the 
quantum state n, as indicated by the shaded region in Fig. 6-1. For large 


I 



Fig. 6-1. Classical constant-energy paths in phase space for one-dimensional 
harmonic oscillator (schematic); n — quantum number. 
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n, the classical energy H(q, p) = e is almost constant (e = « n ) throughout 
the shaded area (more precisely, the fractional variation in e within the 
shaded area becomes smaller as n increases). We can therefore break up 
the classical integral over q, p space into discreet (as in the quantum sum 
over states) contributions from each “shaded area” (of magnitude A), 
because the integrand is essentially constant throughout each such area. 
Thus, 

e -*nltT h> 


as before. This argument becomes exact as T —> oo and higher quantum 
numbers dominate in the sum. 

As a second example, consider a particle in a cubical box of volume V. 
At sufficiently high temperatures [Eqs. (4-8) and (4-10)], 

q = £ e~*W. lkT (2»™*l) 8/: V (6-8) 

The classical analog of the above sum is 
+*> 

«oi M » = cf HI llj e - W( »V*V* > «> / * 7, dx dy dz dp x dp y dp t , (6-9) 

v 

where 

H(p z , p y , p.) = ^ (P* + Pv + P*). (6-10) 

The momentum p x = mi, etc. (Appendix VII). The potential energy is 
zero in the box and infinite outside the box. Therefore e~ a,kT is zero 
outside, and the integral need be carried out only over the inside of volume 
V. We find 

Sda 8 . = c'(2TrmkT) s,2 V. (6-11) 

Comparison with Eq. (6-8) shows that e' = 1/A 3 . 

From these two examples we might surmise that, in general, if there are 
n degrees of freedom, the phase integral has to be divided by A" to give 
the classical (high-temperature) partition function. This is consistent 
with the uncertainty principle (a factor A for each product dq dp). This 
conjecture turns out to be correct. It can be checked, as above, for each 
special case as it arises. Also, a general quantum-mechanical justification 
can be given (Chapter 22). We therefore adopt this correspondence prin¬ 
ciple os a general rule. 
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6-2 More general systems. We set down in this section a few basic 
equations in classical statistical mechanics that we shall need throughout 
the rest of the book. These pertain to more general systems than those 
considered in the preceding section. 

Suppose that the Hamiltonian for one molecule, in a system of inde¬ 
pendent molecules, is separable in the form 


H = H cU „ + (6-12) 

where H c i„„ refers to n degrees of freedom that can be treated classically 
(e.g., translation and often rotation; see Chapters 8 and 9), and # qu ant 
refers to the remaining degrees of freedom that cannot be treated classically 
(e.g., electronic states and vibration). Then, by Eqs. (3-13), (3-14), 
and Section 6-1, 

9 ~ 9class9quant> (6—13) 

where 

9«i... = ^ / e -"ci. M («.i»/*r dqi rfpi ... dqn d Pn . (6-14) 

Alternatively, of course, we can use in place of 9 c u«« the high-temperature 
limit of the quantum-mechanical q for the n degrees of freedom included 
in Eq. (6-14). But usually the direct classical calculation is easier. 

As was mentioned at the beginning of this chapter, the qualitative 
criterion for the legitimate employment of classical statistics in a partition 
function q for a single molecule (or in the q belonging to some of the degrees 
of freedom of a single molecule) is that quantum states with large quantum 
numbers contribute heavily to q (Problem 6-1). This will be the case if 
kT » Ae, where At is the magnitude of the energy separation between 
successive energy levels (Problem 6-2). This is the same criterion we have 
already used [see Eq. (4-8) and following Eq. (5-7)] for replacing the 
quantum sum for q by integral. The use of an integral in place of the 
quantum sum is therefore equivalent to a classical treatment. 

We turn now to the consideration of a system of N indistinguishable 
monatomic molecules in a volume V with translational degrees of freedom 
only and no intermolecular forces. The classical phase integral in this 
case is [see Eq. (6-9) and Appendix VII] 

I = J e - W(p)/ * r dx i dyi dzi dp xl dp yi dp t i ... 

• • • dx n dyN dzfi dpxN dp v ?f dp z y, (6-15) 

where 

H(p) = ^ (Pxi + • • • + pIn)- 


(6-16) 
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All the integrations in Eq. (6-15) are easy, and we find 

/ = l(2TrmkT) 3l2 V] N . (6-17) 

We cannot simply set Q 0 iass equal to I/h aN here because we are concerned 
with a system of N indistinguishable molecules, and not just a single 
molecule as in the previous examples in this chapter. In fact, we have 
already found in Chapter 4 that the high-temperature quantum-mechanical 
Q for this system is given by Eq. (4-12), and Q c Uas must agree with this 
result. That is, <3 c u»8 = I/h 3N Nl or 

^ cU8S = N\h 3N f e mr)lkT dxi... dp lN = jjj ^ • (6-18) 

We may understand the division by A! in this equation as follows. In the 
integral I over all classical states (q, p) of the system, the molecules are 
treated as distinguishable. For example, let N = 2, and let dr in Fig. 6-2 
represent an element of volume dx dy dz dp x dp y dp, in the phase space of 
a single molecule and dr' represent another such element of volume located 
at a different position. Then there are two separate contributions to the 
integral I counted (a) when molecule 1 is in dr and 2 is in dr', and (b) 
when 2 is in dr and 1 is in dr'. In general, with N molecules, there would 
be Nl separate contributions of this type arising from all the possible 
permutations of the molecules. Actually (i.e., from the point of view of 
quantum mechanics), the molecules are indistinguishable, and the inter¬ 
change of particles in the space of Fig. 6-2 does not lead to new states. 
Thus the classical phase integral overcounts the states by a factor N\, 
and this is corrected for by the division indicated in Eq. (6-18). It should 
be noted that in classical theory dr and dr' may be arbitrarily small so 
that we never have the complication [see Eq. (3-9)] that two molecules 
are in the same classical state. 



Fig. 6-2. Elements of volume in the phase space of a single monatomic 
molecule. 
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We have already seen in Chapter 4 that the condition (4-6) must be 
satisfied in order that the result given in Eq. (4-12) or (6-18) be correct, 
i.e., in order for classical statistics to be valid. We have also seen in the 
discussion following Eq. (4-8) that the condition (4-6) for absence of 
quantum effects is very much more stringent than the condition A E — 
0 (Ae) « kT; (4-6) is therefore not only a necessary but also a sufficient 
condition for the validity of classical statistics. By “classical statistics” 
here we mean use of (a) the factor 1/N\, and (b) the classical phase integral. 

Now suppose intermolecular forces are present in the system just dis¬ 
cussed, so that there is a potential energy U(x i,..., zjf) that depends on 
the location of the molecules in the volume V. The classical partition 
function here is 

&ia» = tfjpv fe~ H ^ p)lkT dxt... dp tN , (6-19) 

where 

H(q, v) = ^ (Pii + • • • + P?n) + U(xi ,..., 2 at). (6-20) 

The explanation following Eq. (6-18) of the use of AT! in that equation is 
valid irrespective of the existence of a potential-energy contribution to H. 
Therefore a factor l/Nl is also inserted in Eq. (6-19), though in this case 
we do not have available the high-temperature limit of the quantum Q 
as a check. This check is provided, however, in Chapter 22. As to the 
criterion for the use of classical statistics in this case, we note that the 
paragraph preceding Eq. (4-8) gives a quite general interpretation of the 
condition (4-6). Hence we may consider (4-6) to be applicable regardless 
of intermolecular forces and whether the system is condensed or not. 
The values of A 3 N/V in Table 4-1 are for liquids and become particularly 
pertinent at this point. We see, for example, that at its boiling point 
liquid neon, but not liquid argon, should show noticeable quantum effects. 
Hence Eq. (6-19) can be applied to liquid argon at its boiling point. 

The momentum integrations in Eq. (6-19) can be carried out immedi¬ 
ately, as before, and we obtain 

^ cla " = N\A 3N ’ (6-21) 

where 

Z N = j e~ U{x ' . * N )lkT dx i... dz N . (6-22) 

In Eq. 6-22, Z\ is called the classical configuration integral. In the absence 
of intermolecular forces, as in Eq. (6-18), (7=0 and Z N = V N . Equa¬ 
tions (6-21) and (6-22) are fundamental equations in the study of mona¬ 
tomic, classical, imperfect gases and liquids (Part III). 
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Finally, consider a multicomponent system of molecules which may be 
polyatomic. Suppose that the Hamiltonian for the whole system is 
separable, as in Eq. (6-15). Then 


H — tfdass "h -^quantj 

(6-23) 

Q — Qclass Qquant 

(6-24) 

“ n. f e < * ?class dpelMMt ’ 

(6-25) 


where n,- is the number of classical degrees of freedom for a molecule of 
component i. Usually (see Chapters 8 and 9) H e \ aaa includes translation 
and rotation, while // quan t includes internal vibration (and possibly an 
electronic contribution). Because of the relatively strong forces involved 
in the internal vibrations of polyatomic molecules, these vibrations are 
not perturbed much by intermolecular forces, and hence are approxi¬ 
mately separable, as in Eq. (6-25). 

6-3 Phase space and ensembles in classical statistics. Although we 
shall not use this material later in the book, for completeness we make a 
few very brief comments about the classical phase space of an isolated 
macroscopic system, of volume V, containing molecules of one or more 
species. These remarks would serve as an introduction to a discussion of 
the principles of classical statistical mechanics if we were going to give 
an independent development of this subject. However, as already men¬ 
tioned, in this book we regard classical statistics as a limiting form of 
quantum statistics. 

Let the total number of coordinates <?,- required to locate the positions 
of all molecules be n. For example, if the system contains N monatomic 
molecules, n — ZN. Thus n is an extremely large number. For each 
coordinate we define a conjugate momentum p< by Eq. (VII-6). The 
phase space of the system is therefore 2n dimensional: n coordinates and 
n momenta. The state of this classical system at any time t is completely 
specified if the position and velocity components of each molecule are 
specified, i.e., if all the g’s and p’s are assigned definite values. All this 
information is condensed into the location of a single point in the 2 n 
dimensional phase space. Such a point is called a phase point or “repre¬ 
sentative point. ” That is, the point represents the complete classical state 
of the system. 

With the forces of the system given, assignment of the position of a 
phase point in phase space at time t completely determines the future 
(and past) trajectory, or path, of the point as it moves through phase 
space in accordance with the laws of mechanics. The equations of motion 
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of the phase point are, in fact, Hamilton’s equations, (VII-7). In prin¬ 
ciple, this system of 2n first-order differential equations can be integrated 
to give • • •, ?»(0. Px(<), • • •, Pntt)- The 2n constants of integration 
would be fixed by knowing the location of the phase point at some time 
l = f 0 . Of course, in practice, such an integration is quite hopeless. 

Now suppose we make up a total of 31 isolated systems, all with the same 
V and number of molecules of each species, and all replicas—as far as 
thermodynamic properties are concerned—of the single experimental sys¬ 
tem of interest. This is an ensemble, just as in Chapter 1. The detailed 
classical state of each system can be represented by a point in the same 
phase space. The points move independently, each along its own trajectory, 
since each system is isolated. The whole ensemble then appears in phase 
space as a “cloud” of moving representative points with an essentially 
continuous density (31 —> oo). A very important and interesting theorem, 
due to Liouville, which we shall not prove or use, is that in the immediate 
neighborhood of any particular phase point, as it moves along its tra¬ 
jectory, the density of phase points remains constant. 

Just as in quantum mechanics, we find it necessary to replace the desired 
single-system time average of mechanical variables, such as pressure, by 
instantaneous ensemble averages (first postulate). But in order to calculate 
ensemble averages, we have to know the density of the cloud of representa¬ 
tive points in the various portions of phase space. Corresponding to the 
quantum postulate (Section 1-2) of equal probability for each quantum 
state of an isolated system (E, V, Ni, N 2 , ■ ■ . fixed), we have the analogous 
classical postulate (second postulate) of constant density of phase points 
throughout the region of phase space between the surfaces E = constant 
and E + SE = constant, where SE is arbitrarily small. As in Section 
1-3, we can deduce from this a density of phase points proportional to 
e -fH(g,p) [ compare Eq. (6-19) for example] for an ensemble representative 
of a closed system in contact with a heat bath, etc. 

The two postulates above, combined, lead to the classical “ergodic 
hypothesis”: the representative point of a single isolated system spends 
equal amounts of time, over a long period of time, in equal volumes of 
phase space between the surfaces E = constant and E + SE = constant, 
where SE is arbitrarily small. 

6-4 Maxwell-Boltzmann velocity distribution. As an example of an 
application of Section 6-2, let us deduce here the important formulas for 
the classical translational velocity and kinetic-energy distributions. We 
can keep the discussion quite general: we are by no means restricted to 
ideal monatomic gases only. Specifically, let us consider any one-com¬ 
ponent system for which the internal vibrations are separable, and trans¬ 
lation and rotation can be treated classically (e.g., liquid nitrogen). 
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This is a special case of Eq. (6-25): 

Q = jynSSU* / e ~ HlkT d P* d( L> d P» (6-26) 

where q x refers to 3 N translational & jrdinates, q, to n r N rotational co¬ 
ordinates (n r per molecule), and 

H = K x ( Px ) + K,(q, t p t ) + U(q x , q,). (6-27) 


The first term on the right of Eq. (6-27) is the translational kinetic energy 
(Eq. 6-16); the second term is the rotational kinetic energy (in general 
a function of both rotational coordinates and momenta, see Chapters 8 
and 9); and the third term is the intermolecular potential energy. Inter- 
molecular forces in general depend on rotational orientations of molecules 
as well as on distances between centers of mass, so U is a function of q> 
and q x . 

By analogy with Eq. (1-28) and as a consequence of the fact that classi¬ 
cal statistical mechanics is a limiting form of quantum-statistical me¬ 
chanics, we can state that 

e —HlkT dqx dj)x dpt (6-28) 

fe~ HlkT dq x dp x dq> dpe 


is the probability that the system will be observed in the classical 
translation-rotation state dq x dp x dq$ dp t , irrespective of the (independent) 
quantum-vibrational state. On integrating (6-28) over q x , q*, and p t , we 
then have that 

e~ K * lkT dp x {e~< K > +U)lkT dq x dq» dp, _ e~ K * lkT dp x 
/ e~ K * lkT dp x f e~ < - K * +U)lkT dq x dq, dp, j e~ K * lkT dp x 


is the probability that the system is in the translational momentum 
state dp x , irrespective of other conditions (q x> q t , etc.). Finally, since K x 
has independent and equivalent contributions (Eq. 6-16) from each of 
the N molecules, we can integrate (6-29) with respect to all translational 
momentum components except those belonging to any one molecule (say 
molecule 1, but we drop the subscript 1 for convenience), and find 


e -(pl+pl+P 2 ,)l2mkT ^ ^ ^ 

(2irmkT) 3 l 2 


(6-30) 


for the probability that any one molecule is in the translational momentum 
state dp x dpy dp t , or for the fraction of all molecules in this state, irre¬ 
spective of other conditions. The integral over p x , p y , and p, (— oo to +oo) 
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has been carried out in the denominator of (6-30) with the result indi¬ 
cated. It should be emphasized that (6-30) applies to any system en¬ 
compassed by Eq. (6-26). Actually, the restriction to a one-component 
system is not necessary. Each component in a multicomponent system 
has its own molecular mass and its own independent momentum dis¬ 
tribution, as in (6-30). 

If we change from cartesian momentum components p x , p v , p, to the 
corresponding spherical coordinates, p, 8, <p defined by 


then 


p 2 = pl + pl + P% Px = p sin 8 cos ip, 
Pt—p cos 8, Pv = P sin 0 sin ip, 

e -p*l2mkT p 2 gi a 0dp d8 dip 
c 2irtnkT)3l2 


(6-31) 


is the fraction of molecules in the momentum state dp dd dip. On inte¬ 
grating (6-31) over 8 and ip, we have 


4tt e- pt ' 2mkT p 2 dp 
{2KmkT)W 


(6-52) 


for the fraction of molecules with momentum between p and p + dp. 
Finally, if we put p = mo and multiply by N, (6-32) becomes 


4ir N 


f m Y' 2 

\2wkTj e 


-mv>l2kT v 2 fa 


(6-33) 


for the number of molecules with velocities between v and v + dv. This 
is the well-known Maxwell-Boltzmann velocity distribution. If we write 
« = mu 2 /2 for the translational kinetic energy of a molecule, (6-33) 
transforms into 

27rN ’ (6-34) 


(irkT)W 


e-‘/* r e 1/2 de 


for the number of molecules with values of e between e and « + de. This 
agrees (Problem 6-3) with Eq. (4-11), deduced from quantum statistics 
for a special case (ideal monatomic gas). 

If Eq. (6-30) is integrated over, say, p„ and p„ we obtain 


e -rll2mkT dpx 

(2irmkT) l l 2 



(6-35) 


for the fraction of molecules with ^-component of momentum or velocity 
in dp x or dv x . 

Applications of the above results are included in the problems. 
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Problems 

6-1. Show that in a system of independent harmonic oscillators, all with 
frequency v, the quantum number of the state with energy equal to the average 
energy per oscillator is n = kT/hv, in the classical limit (JcT/hv » 1). 
(Page 116.) 

6-2. Show that at high temperatures, q for a simple harmonic oscillator 
(Eq. 6-6) can be expanded in the form 



For what value of kT/hv is the approximation q = kT/hv accurate to 0.01%? 
Show that Ae/kT = hv/kT for a harmonic oscillator, where Ac is the energy 
difference between successive energy levels. (Page 116.) 

6-3. Verify that the Maxwell-Boltzmann energy distribution from Eq. (4-11) 
(quantum statistics) is the same as (6-33) (classical statistics). (Page 122.) 

6-4. Give an argument analogous to Eqs. (6-5) through (6-7), showing that 
the area of phase space per quantum state is h for a particle in a one-dimensional 
box. 

6-5. Prove that the surfaces in phase space H{q, p) = E (constant) and 
H(q, p ) = E\ where E v* E', can never cross. 

6-6. Show that in the classical Hamiltonian, any independent term, potential 
or kinetic, of the form a£ 2 ({ = q or p; a = constant) will lead to a contribu¬ 
tion kT/2 (per term) to the energy E and k/2 to C v . Examples are Eqs. (5-25), 
(6-2), and (6-20). 

6-7. Show that according to (6-33): (a) the most probable velocity is 
v* = (2kT/m) m ; (b) the mean velocity is 8 = ( 8kT/rm ) l/2 ; (c) the root 
mean-square velocity is (» 2 ) I/2 = ( 3kT/m) 1/ 2 ; and (d) the mean translational 
kinetic energy per molecule is ZkT/2. 

6-8. Calculate 8 in cm/sec for H 2 and O 2 at 0°C. 

6-9. Show that according to (6-35): (a) the most probable velocity com¬ 
ponent- »* is^zero; (b) 5* = 0; (c) (ri) 1/2 = ( kT/m ) ,/2 ; and (d) verify that 
» 2 = + #3 = 3t£ 
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CHAPTER 7 

INTRODUCTION TO LATTICE STATISTICS: 

ADSORPTION, BINDING, AND TITRATION PROBLEMS 

In the first two sections of this chapter we consider problems in which 
molecules of one species can be “bound” on one-, two-, or three-dimen¬ 
sional arrays of sites presented by regular arrangements of molecules or 
molecular subunits of a different species. A bound molecule executes 
three-dimensional vibrational motion in the neighborhood of its site. The 
most familiar example is the binding or adsorption of gas molecules 
on the two-dimensional lattice of sites presented by the surface of a 
crystal. But one-dimensional cases (e.g., binding of ions from solution 
on a linear polyelectrolyte molecule) and three-dimensional cases (e.g., 
absorption of hydrogen atoms by palladium) are also common. Irre¬ 
spective of the dimensionality, the system of molecules attached to a set 
of sites is sometimes referred to as a lattice gas. 

We restrict the discussion in this chapter to models in which the binding 
on any one site (or, possibly, small group of sites) is independent of the bind¬ 
ing on the remaining sites. Chapter 14 is devoted to the more complicated 
problems in lattice statistics in which this independence is lacking because 
of interactions between molecules bound on neighboring sites. 

Section 7-3 is concerned with the similar problem of binding of mole¬ 
cules on (indistinguishable) groups of sites which are free to move (e.g., 
binding of small molecules or ions on protein molecules in solution, or the 
hydrogen ion equilibrium of acid molecules in solution, etc.). Interactions 
between different groups of sites (e.g., between two protein molecules) 
are not included here but will be treated in Chapter 19. 

In Section 7-4 we discuss the elasticity of a linear polymer made up 
of units each of which can be in a "short state” or a “long state.” This 
system, we shall find, is formally equivalent to the lattice gas of Section 7-1. 
Adsorption of another molecular species on such a polymer is also studied 
in Section 7-4. 

7-1 Ideal lattice gas (Langmuir adsorption theory). An ideal lattice 
gas is a system of N molecules bound not more than one per site to a set 
of M equivalent, distinguishable, and independent sites, and without 
interactions between bound molecules. The arrangement of the M sites 
in space is immaterial. In the Langmuir adsorption model, the sites are 
arranged in a regular two-dimensional array on the surface of a crystal, 
and the bound molecules come from a gas phase which is in equilibrium 
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Fig. 7-1. Adsorbed molecule A near surface of crystal B. 


with the lattice gas (adsorbed phase). For concreteness and because of 
its importance, we shall discuss this particular (adsorption) problem 
explicitly here, but the treatment is not restricted to it, as will be evident 
from Section 7-4 and Chapter 14. 

As an example, suppose the monatomic gas A is adsorbed on the surface 
100-plane of a simple cubic lattice of solid B. We assume for simplicity 
that the forces holding the solid together are much stronger than the 
adsorption forces, so that the solid is essentially unperturbed by the 
presence of gas molecules on its surface. Thus the solid merely plays the 
role of providing a potential field for the adsorbed molecules. The thermo¬ 
dynamic system we consider consists, then, of gas molecules “bound” in 
this potential field. 

Suppose that the potential energy of interaction w(r) between a 
molecule of A and a molecule of B is of the Lennard-Jones type (Appen¬ 
dix IV). The adsorbing force holding an A molecule to the surface of B 
is then the sum of a number of such interactions. In Fig. 7-1 .s shown an 
adsorbed molecule A at a distance z from the surface of B (taken as the 
zy-plane), and a few of the distances r,- which should be substituted in u(r) 
to give the total potential energy of interaction of A with all molecules 
of the solid, for this particular location of A : 


f/= (7-1) 

t 

In Eq. (7-1), U is a function of x, y, and z, the coordinates of A. If we hold x 
and y fixed and consider the dependence of U on z (i.e., along a line perpen¬ 
dicular to the surface), U(z) will have a qualitative appearance similar to 
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Fig. 7-2. Periodic variation over surface of depth f7o of the potential well 
U(z) for an adsorbed molecule. 


«(r) in Fig. IV-1 and the approximate mathematical form (Problem 7-1) 


rr/ . const const' 
U{z) = — - -£-• 


(7-2) 


Thus the free translational motion of a gas molecule in the 2 -direction is 
replaced on adsorption by vibration in the potential well associated with 
Eq. (7-2). In the harmonic-oscillator approximation, the frequency of 
vibration v t is determined by the curvature of U(z) at its minimum. 
Usually v„ is of order 10 12 sec -1 (Problem 7-2). 

For different choices of the fixed values of x and y, U(z) will be different 
(in other words, U is a function of x and y also). Thus U(z) directly above 
a B molecule will be different from U(z) above the center of a square of B 
molecules. In particular, the depth U 0 of the potential well in U{z) will 
vary periodically in both x- and y-directions. Hence Uo is a function of 
x and y. Figure 7-2 illustrates this variation, where Uo is plotted against 
x along one particular line (y constant) in the surface, as indicated in the 
figure. It is clear from Fig. 7-2 that motion parallel to the surface involves 
passing over potential barriers F 0 - With van der Waals forces, these 
barriers are of order 0.3 to 1 kcal-mole -1 . At temperatures sufficiently 
low that the thermal energy kT of the adsorbed molecules is small com¬ 
pared with the height of the barrier V 0 , the molecules will be trapped 
(“localized”) in the neighborhood of potential minima in U(x, y, z), except 
for occasional passages over the barrier or evaporation and recondensation 
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(necessary for equilibrium with the gas phase). When kT is large compared 
with Vo, the periodic variation in Uo becomes unimportant, and the 
adsorbent surface, in effect, becomes a continuum. 

The Langmuir model corresponds to localized adsorption (kT <K V 0 ). 
At the end of this section we shall also discuss briefly the case of a dilute 
“mobile” (kT y> Vo) adsorbed phase. The transition from localized to 
mobile adsorption, as T is increased, is an interesting problem which will 
be dealt with in Chapter 9. 

In localized adsorption, the adsorbed molecule has three vibrational 
degrees of freedom (replacing three translational degrees in the gas). 
We have already mentioned the vibration in the z-direction. In addition, 
there is vibration in x- and y-directions (if these are the normal coordinates 
for the two-dimensional motion) around the minima of U 0 (x, y), with 
frequencies v x and v y . These frequencies are usually a little less than 
10 12 sec -1 . 

The partition function for a single adsorbed molecule, in the harmonic- 
oscillator approximation, is then 

q(T) = q x q v q*e- Uo ° lkT , (7-3) 

where q x , q„, and q z are one-dimensional harmonic-oscillator partition 
functions (Eq. 5-6) with frequencies v X} v y , and v z , respectively, and with 
zero of energy in each case at the bottom of the potential well U(x, y, z). 
As explained above, the crystal surface is assumed to present an external 
and fixed potential field, so that q is a function of T only. Since we want 
to investigate the equilibrium between adsorbed and gas molecules, we 
must choose the same zero of energy for the two phases. We take an 
isolated gas molecule [i.e., z = oo in Eq. (7-2)] at rest as the zero. With 
this zero, we have to insert the Boltzmann factor in U 00 (a negative quan¬ 
tity) in Eq. (7-3), where Uoo is the potential energy at the minima in 
U(x, y, z) [or in Uq(x, y)]. Then the heat of adsorption per molecule at 
0°K is Uoo + h(v x + v y + v,)/ 2. 

If we had a system of N sites and N molecules, we would write Q = q N , 
just as in the Einstein model for a crystal. But here the number of sites 
(equivalent but distinguishable) is M > N. There is therefore a con¬ 
figurational degeneracy not present in an Einstein crystal (it would be 
present if the crystal had lattice vacancies), which must be taken into 
account. For each quantum state of an Einstein crystal there are here 
M\/N\(M-N )! quantum states—this being the number of ways N indis¬ 
tinguishable molecules can be distributed among M labeled sites. Hence 

« (w '"- r) = VW=T01' <»-*> 
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and 

InQ = M In M - NlnN - (M - N) In (M - N) + N\nq. (7-5) 

The basic thennodynamic equation for the lattice gas (or adsorbed 
phase) is 

dE — TdS — $ dM + pdN, (7-6) 


where M is proportional to the volume, area, or length, depending on the 
dimensionality of the lattice. That is, M is the “external variable” in this 
case, and $ is essentially a pressure, though it has dimensions of energy 
here. Thus, in two dimensions, if we write Ct = Ma where Ct = area and 
a is a constant, then $/a is the so-called surface or spreading pressure 
(dimensions of energy per unit area) ip. 

From Eqs. (7-5) and (7-6) we have 


b - (w)„ - - •> - • + »•* + *«* + ••• • 

JL _/ainQ\ _ . _ e_ 

kT~ \dN )m, T (l - 0)q ’ 


(7-7) 

(7-8) 


where 6 — N/M, the fraction of sites occupied. Equation (7-7) is the 
equation of state. Note that in the limit as 0 —> 0, $ —> 8kT (which is the 
same, in three dimensions, as p — » ,)kT, where p — N/V). If the adsorbed 
phase is in equilibrium with a gas phase at pressure p [assumed ideal as in 
Eq. (4-25)], 


JL 

kT 


ln 


0 


(1 - 6)q 


Mga» 

kT 


m°(D 

kT 


+ In P, 


(7-9) 


or 

6{V ’ T) = l + x(T)p ’ X(T) = <?(7 , )«‘‘° <r,/tr (7-10) 


Equation (7-10) is the Langmuir “adsorption isotherm,” giving the amount 
of gas adsorbed as a function of gas pressure at a fixed temperature. When 
p —* 0, 8 —> Xp, and when p — > oo, 0 —> 1. Many adsorption systems 
follow this equation approximately. Statistical mechanics provides all 
the necessary molecular details to calculate x(T) explicitly (Problem 7-3). 

The entropy of the lattice gas (Eq. 1-33) can be put in the form 
(Problem 7-4) 

S = Sconfig 4" $vib> (7—11) 


where 


Sconfig — — N)\’ 


(7-12) 
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/Svib = Nk 


In 9 + 


m rilngN 
dT )‘ 


(7-13) 


We note that S coa n e is independent of temperature, whereas S vl b —* 0 as 
T —■► 0 (just as for an Einstein crystal). If equilibrium could be main¬ 
tained to T — 0, we would have S > 0 at T = 0. This is not an exception 
to the third law of thermodynamics (Section 2-4), since the combined 
system sites + lattice gas is not a pure crystal, but a binary mixture. 
That is, a lattice gas can exist only in the presence of an additional molecu¬ 
lar matrix (e.g., a crystal) that provides the sites. 

For future reference, let us also apply the grand partition function to 
this system. We have 


A 


♦Af/tr _ v M!(gX) Af 
fco N\{M-N)\ 


or 


= (1 + q\) M , 


(7-14) 


= In (1 + ffX), 


(7-15) 


where X = e* lkr . This agrees with the result obtained by eliminating 6 
between Eqs. (7-7) and (7-8). Also, from Eq. (1-67), 


which is essentially just Eq. (7-10). 

For contrast we consider a dilute, mobile adsorbed phase (two-dimen¬ 
sional ideal gas). In this case the adsorbed molecules still vibrate in the 
z-direction with partition function q„ but have free translational motion 
in the zy-plane. The xy partition function for one molecule is [see Eq. (6-9)] 


2irmkT „ 
9*v — ^2 


where a = area. Then 


Q = 



9 = 


9*t/9*« 


-UooIkT 


(7-17) 


(7-18) 


For present purposes, we replace $ dM in Eq. (7-6) by <f dct. Then 
we find 
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JL 

kT 



_ N _ Mga. _ H°(T) 
~ u q - kT~ kT 


In p. 


(7-20) 


The adsorption isotherm is then 

~ = x'(T)p, X'(T) = q t e- UwlkT / !kT . (7-21) 


It is interesting to compare the predicted relative amounts of localized 
and mobile adsorption in the limit as p —> 0 where Eq. (7-21) is valid and 
6 —* xp in Eq. (7-10), assuming all conditions the same except for the 
^/-motion. We find from Eqs. (7-10) and (7-21), 

•^mobile (2irmkT/h )a /m 

-= -, (7-22) 

localised QxQy 

where a = Q./M = area per site. The numerator on the right is the 
partition function for a molecule moving freely in a two-dimensional 
box of area a. The denominator is the partition function of a two-dimen¬ 
sional oscillator, which would have a numerical value less than the 
numerator because, in classical language, there is a Boltzmann factor 
exp [—(/*x 2 + f y y 2 )/2kT] (Eq. 6-1) in the integrand of the configuration 
integral for an oscillator, whereas the integrand is unity [Eq. (6-22), 
N = 1] for a particle in a two-dimensional box. Therefore, we find 

^mobile > ^localised- 


7-2 Grand partition function for a single independent site or subsystem. 
In Section 3-4 we discussed a canonical ensemble of subsystems. Prob¬ 
lem 3-5 extends the argument to a grand ensemble of subsystems. Because 
of its importance, we discuss here the same question as in Problem 3-5, 
but use a different approach: we start with the ordinary grand partition 
function and show that it reduces to a product of subsystem grand parti¬ 
tion functions {. 

Consider a macroscopic system of M equivalent, independent, and dis¬ 
tinguishable sites on each of which any number s, from zero to a maximum 
m, of molecules can be “bound” (the nature of the binding or association 
with the site is immaterial here). Let q(s) = e~ t > (,)lkT be the site 
partition function when s molecules are bound to the site. If there is a 
total of N molecules bound on the M sites, and if the number of sites having 
s molecules bound is o„ then the canonical ensemble partition function for 
the system of M sites is 

Q(N,M,T) = Z~ M0) JT 


(7-23) 
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where the sum is over all sets* a = a 0 , «i, ..., a« satisfying the re¬ 
strictions 

E a. = M, £ “• = N - (7-24) 

«—o *—o 


This is a straightforward extension of Eq. (7-4) which, in the present 
notation, would be written 


where 


* 


M! g (0) O0 g(l) ai 

o 0 !oi! 


®o 4" ®i = M , oj = N, 


5(0) = 1, s(l) = ?• 


That is, in the Langmuir problem, s = 0 (site empty) or 1 (site occupied). 
The grand partition function is 

mU 

s= E Q(N,M,T)\ N 

N-0 


^ M!5(0) a °[5(l)X] ai [5(2)X 2 p» • ■ • [ 5 (m^r 

V a 0 ! • • • o»! 


(7-25) 


where we have used the second of Eqs. (7-24) in rewriting in the last 
step, and the only restriction on the sets a is now the first of Eqs. (7-24), 
since we want to sum over all possible values of N for given M. By the 
multinomial theorem [this is just a generalization of the binomial theorem 
used in obtaining Eq. (7-14)], 

2(X, M, D = f(X, T) M , (7-26) 

where 

i = 5(0) + 5(1)X + • • • + q(m)\ m 

= E 900 x '- (7-27) 

•-0 


This is a good example of a summation having been made easier by passing 


* Here and throughout the rest of the book we use a boldface letter to repre¬ 
sent a set of numbers with subscripts 1, 2. For example, we use N for 

Ni, N 2 ,... (in analogy with the components of a vector). We did not introduce 
this notation in Chapter 1 for N 1 , N 2 ,... and 111 , 112 , • • • because there we 
wanted to emphasize the one-component case for simplicity. Hence we used 
N instead of N, etc. However, N is generally preferable because it is more explicit. 



132 


INTRODUCTION TO LATTICE STATISTICS 


[CHAP. 7 


from Q to 3. The sum £ has exactly the form of a grand partition function, 
but it pertains to only a single site instead of a macroscopic system. It is 
rather obvious that if the sites were all different, but still independent of 
each other, 3 = iifa... £m, where & is a sum, as in Eq. (7-27), for site 
i only. The sites must be independent of each other for this simplification 
in 3 to occur. Equations (3-5) and (3-6) are analogs in the canonical 
ensemble. 

If, when all sites are equivalent, we consider each site to be an open 
subsystem, then the macroscopic system of M sites may be regarded as a 
grand ensemble of subsystems (see Problem 3-5). The chemical potential 
and temperature of a single open subsystem are well-defined thermo¬ 
dynamic properties, for they are determined by the chemical potential and 
temperature of the macroscopic reservoir with which the subsystem is in 
contact. Incidentally, in adsorption problems, this reservoir is real and 
not imaginary: it is the gas or solution, in equilibrium with the adsorbed 
phase, providing the molecules or ions being adsorbed. 

The average number of molecules in the macroscopic system is 

or 

F _ /alnj\ _ L7-0sg(s)X* 

m ~ K \ax )t~ Er-o«W x * ’ 

where 3 is the average number of molecules per site. Thus the equation 
relating 3 and £ (subsystem) is formally the same as that relating N and 
3 (macroscopic system). In the differentiation in Eq. (7-29), m is held 
constant, which is analogous to holding M constant in Eq. (7-28). 

The “pressure” $ is related to the site or subsystem grand partition 
function £ by 

e *M!kT = g = 

$ = AT In {(X, T). (7-30) 

Thus, whenever we are concerned with a system of independent, dis¬ 
tinguishable, and open sites (subsystems), we can go directly to Eq. (7-29) 
to determine the mean population 3 and also the population distribution 
[since g(s)X*/{ is clearly the probability of a population a] of each site. 
This is a simpler procedure than using the full grand partition function, 
but of course leads to the same results. The mean population 3 as a func¬ 
tion of X is essentially the adsorption isotherm in an adsorption system. 

The situation with respect to fluctuations in a in a subsystem is com¬ 
pletely analogous to that of fluctuations in the energy of independent 


(7-28) 

(7-29) 
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molecules in Eq. (3-24). We find from Eqs. (7-29) and (2-7), 

F-W-w(g) r -$(g)„ 


or 


N 2 - (N) 2 
~ M 

5 TJ 


just as in Problem 3-8. Since <r N /'N is of order M~ 112 , <r, is of the same 
order of magnitude as 5. That is, the probability distribution in s is broad, 
as should be expected, and not sharp. 

We now consider some applications of Eq. (7-29). First, an ideal lattice 
gas. Here, as was pointed out following Eq. (7-24), a = 0 or 1, g(0) = 1, 
and g(l) = q. Therefore, 

$ = 1 + 2 X, 




q\ 


+ «X 

which is the same as Eqs. (7-10) and (7-16). 

Next, suppose we have a system of M independent pairs of sites: 


(7-31) 


•1 

•1 

•1 

x2 

X2 

x2 


Such a system of sites might occur on a linear polymer, for example. The 
two sites in a pair are different: qi is the partition function [as in Eq. (7-3)] 
for a molecule bound on a site of type 1, and 92 for a site of type 2. Also, 
when both sites of a pair are occupied, let there be a potential energy of 
interaction w between the two bound molecules. So long as there is no 
interaction between molecules on different pairs, the introduction of w 
does not affect the independence of the pairs. Thus, a pair of sites is the 
independent subsystem here, and a = 0,1, or 2. If we consider all possible 
states of a pair of sites with a molecules bound, we see that 


g(0) = 1, 9(1) = 91 + 22, 9(2) = 2i22e 


,-wlkT 


Therefore 

and 


{ = 1 + (2x + 92)X + 2 i22« ""V, 

(9x + 92)X + 29192C 




s = * = _ 

M 1 -I* (91 ■]• 92)X 2i22e- w /* r X 2 


(7-32) 

(7-33) 
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This is the adsorption isotherm. Except for a factor of two, this reduces to 
Eq. (7-31) for an ideal lattice gas if w = 0 and 91 = 32 = 9- The factor 
of two has its origin in the fact that the total number of sites is 2ilf in 
Eq. (7-33) and M in Eq. (7-31). 

As a third example, we derive the well-known B.E.T. (Brunauer- 
Emmett-Teller) equation for multimolecular adsorption. The model, 
which is physically unrealistic, is the following. A surface has M inde¬ 
pendent, distinguishable, and equivalent sites, on each of which an in¬ 
definite n umb er of molecules can be adsorbed in a vertical pile, so that 
« = 0,1, 2,.... Let 91 be the partition function for the bottom molecule 
in a pile (“first layer”), 92 for the next molecule (second layer), etc. The 
positions in a pile are distinguishable in the same sense as in a one-dimen¬ 
sional Einstein crystal. Then 

{ = 1 + 9i^ "I" 9i9sX 2 + 9 i<?293^ 3 + • • • , (7-34) 

and 

$ =— — 9i^ + 29192A 2 + 3919293X 3 + • • • (7-35) 

M { 


This is the adsorption isotherm. Now consider the special case 92 = 93 = 
94 = .... That is, molecules in the first layer (next to the surface) have 
a partition function q u and all others 92 . (Second and higher layers are 
supposed to be “liquid-like,” though a one-dimensional pile is a very poor 
model for a liquid.) With this simplification, 


_ 9i*(l + 2 ga X + 39IX 2 + —) 

1 + 9iA(1 + 92* + 9§* 2 + " *) 

_giX_ 

(1 — 92* + 9i*)(l — 82*) 

_ex_ 

~ (1 — X + cx)(l — x) ’ 

where 

c = 9 i/92> 


(7-36) 


X — 92* — 92**m» 

= q y >kT p. (7-37) 


Equation (7-36) is the B.E.T. adsorption isotherm. 

A typical isotherm (c = 157) is shown in Fig. 7-3. Many experimental 
isotherms with this type of qualitative behavior are known. When c » 1, 
adsorption in the first layer is strongly favored relative to higher layers, so 
the first layer is almost completely filled before higher layers begin. This 
accounts for the “knee” in the isotherm (Fig. 7-3) near 5=1. Up to 
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Fig. 7-3. B.E.T. adsorption isotherm for c = 157. 


almost 8 = 1, 8 = cx/( 1 + ex) (Langmuir equation) when c » 1. On 
thermodynamic grounds we know that 8 -» oo when p —* p 0 (the vapor 
pressure of liquid adsorbate at the temperature of the experiment; we 
assume T < T c , the critical temperature). That is, when p approaches p 0 , 
bulk liquid will begin to condense on the adsorbing surface. On the other 
hand, 8 —♦ oo when x — * 1, according to Eq. (7-36). Therefore, from 
Eq. (7-37), we have 

q a ef filkT = —. (7-38) 

Po 

which is equivalent to the vapor pressure equation for an Einstein crystal 
(Problem 5-11). 

In the B.E.T. theory, the surface pressure $/a is determined by 

e* MlkT = S = 
or 

± - In ; - In + X }1> ) 

= ■» (’ T-t” )- < 7 ' 39 > 

We note that # —> oo as x —» 1. But, thermodynamically, $/a is neces¬ 
sarily finite at x = 1. This is a serious fault of the B.E.T. theory. 

The general method of this section can easily be extended to cases in 
which more than one species of molecule can be bound to a site (i.e., the 
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subsystem is multicomponent). For example, let us return to the system 
above of M independent pairs of sites, but suppose now the two sites in a 
pair are equivalent. Two species A and B can be bound on the sites (not 
more than one molecule per site) with partition functions q A and qb, and 
if both sites in a pair are occupied, there is an intermolecular potential 
energy w AA , v>ab, or wbb, depending on the species making up the pair. 
The general equations for two species A and £ are (Problem 7-5) 


S(\ A , Xb, M, T) = £(X A , Xb, T) m , 

(7-40) 

i = 2 q( 8a > s b)^'a Xjf, 

*A“° *B”° 

(7-41) 

, _ _ x /dln{\ 

A M A \ dX* )\ b ,t ’ 

(7-42) 

and a similar equation for 3b. Equation (7-41) is a two-component grand 
partition function for a subsystem. In the special case under consideration, 

i = g(0,0) + 3(1, 0 )Xa + 3(0, 1)Xb + 3(2, 0)\ a A 


+ 8(1, 1)X a Xb + 3(0, 2)X|, 

(7-43) 


where 


3 ( 0 , 0) = 1, s(l, 0) = 2q A , q(0,1) = 23 s, 

9(2, 0) = q A e~ VAA/ir , 

3(1,1) = 2q A q B e- WABlkT , (7-44) 

3 ( 0 , 2) = q%e- w ** lkT . 

Then 

7? A 3(1,0)X^ + 23 ( 2 ,0)\3 + 3(1, DXaXb 

_ _ _-—— -—-— , (7-45) 

with a similar equation for Sb. This is equivalent to Problem 7-6 if w AA = 
w A b = wbb — 0 . 

7-3 Systems composed of independent and indistinguishable sub¬ 
systems. Here we extend the treatment of the previous section to systems 
in which binding or adsorption takes place on sites which are attached to 
freely moving adsorbent molecules. Examples are: adsorption of ions from 
solution onto protein or other macromolecules; titration curves (i.e., 
binding of H + ) of polybasic acids or polyacidic bases; adsorption of a gas 
on small dust particles; etc. The subsystem here consists of one adsorbent 
molecule (protein, etc.) and s bound (H + , etc.) molecules, where 0 < 
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s < m as before. We still assume, for simplicity, that the adsorbent mole¬ 
cules are unperturbed in their other properties by the presence of adsorbed 
molecules. We shall use a language appropriate to a gaseous system, since 
we have not yet discussed solutions. But we shall see in Chapter 19 that, 
under suitable conditions, if the adsorbent molecules are in a solvent rather 
than a vacuum, we merely have to replace the adsorbent partial pressure p 
by the osmotic pressure n. The gas is assumed ideal—that is, sufficiently 
dilute so that intermolecular interactions are unimportant. However, 
interactions between adsorbed molecules on the same adsorbent molecule 
are included. Interactions between adsorbent molecules will be discussed 
in Chapter 19. 

The system we consider is shown schematically in Fig. 7-4. There are 
M adsorbent molecules in a volume V, to each of which is attached from 
zero to m adsorbed molecules, with a total of N adsorbed molecules. Free 
adsorbate molecules are not included in the system for convenience and 
also because this omission leads to equations which are completely anal¬ 
ogous to the osmotic equations of Chapter 19. Since the gas is assumed 
ideal (each component behaves independently), no error is introduced by 
this procedure, and the pressure p of the system as we have defined it is 
actually the partial pressure of the adsorbent molecules in the adsorbent- 
adsorbate gas mixture. 

This is a two-component system. The basic thermodynamic equations 
are 

dE = TdS - pdV + pdN + pldM, 

(7-46) 


A = E — TS = -pV + fiN + n'M, 


where pi is the adsorbent chemical potential. 



Fig. 7-4. Adsorbent molecules (large) with adsorbed molecules (small) 
attached. 
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We let q(s) be the partition function of an adsorbent molecule with s 
adsorbed molecules attached. This will include, as factors, the translational 
[hence each q(s) is proportional to F], rotational, etc., partition functions 
of an adsorbent (s = 0) molecule. Also, for the adsorbed molecules, q(s) 
will include a factor </* [as in Eq. (7-3)], possibly a configurational factor 
or factors (for distributing s molecules on m sites), and Boltzmann factors 
in the potential energy of interaction between the s adsorbed molecules, if 
such interactions are present. The quotient q(s)/q( 0), which we denote 
by 9 o(*)> refers to the adsorbed molecules only and corresponds to §(s) in 
the preceding section [in the preceding section, q( 0 ) = 1]. 

We now treat this system as an ideal gas mixture of m + 1 different 
molecular species, corresponding to the different numbers of adsorbed 
molecules, s = 0, 1 , ,m. If there are a, molecules of species s in the 

system, then the canonical ensemble partition function is [according to 
Eq. (3-15)] 

q(0) a o q( 1)«» q{m) a - 
a<>! «i! fl»»! 

However, as a partition function we want a sum over all possible states of 
the system, not for fixed a 0 ,.. ., a m , but rather for all possible a 0 ,..., a m 
consistent with fixed N and M. Therefore 


Q(N, M, V, T) 


XT' g( 0 ) a ° • • • q(m) am 

V «<>!••• a»! 


(7-47) 


subject to the restrictions (7-24). Equation (7-47) also follows from 
Eq. (7-23) on dividing by M\, as should be expected. 

We wish next to multiply Q by A = e l ‘ lkT and sum, as in Eq. (7-25). But 
first we have to digress briefly to consider what kind of a partition function 
this will give us. Summing over N with M fixed corresponds to a system 
open with respect to adsorbed molecules but closed with respect to ad¬ 
sorbent molecules (see the last paragraph of Chapter 1). From the rule 
mentioned in connection with Eq. (1-92), the partition function, which we 
denote by T, is related to thermodynamics by 


where 


r(M, F, M, T) = e 


(pV-n'M-HkT 


q(0) M Z M 
ilf! ’ 


£ Q(N, M, F, T)\ n (7-48) 

N-0 

(7-49) 


{ = 1 + <?o(l)X + • • • + qo(m)\ m . (7-50) 


In Eq. (7-49), q( 0) is a function of the thermodynamic variables F and T, 
and {is a function of n and T. 
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The average number of adsorbed molecules is 


7? = 


Zn NQ\ n = ( d In r\ 
r \ ax 



(7-51) 


Thus Eq. (7-29), either as it stands or with q(s) replaced by qo(s), is also 
valid here: the number of adsorbed molecules per subsystem depends on 
the nature of the group of m sites, but not on whether they are moving 
around or not—just what one would expect intuitively. 

It is instructive to consider the grand partition function as well (system 
open with respect to both components): 


e pVlkT = S(V, ft, y!,T) = £ Q(N, M, V, T)\ N \' M 

N.M 


= E 


q(0) M t M X* = e4(0)tx . 


Af-0 


M! 


Now 


This is the expected equation of state. Also, 

-Mil 


(7-52) 

(7-53) 


(7-54) 


which is equivalent to Eq. (7-51). 

The simplest application of Eq. (7-51) is to a system of adsorbent 
molecules each of which has only one site for binding. The hydrogen ion 
equilibrium of a dilute monobasic acid would be an example. Then Eqs. 
(7-51) and (7-31) are applicable, and we get just the Langmuir adsorption 
isotherm. This is also the familiar equation of a simple titration curve 
(Problem 7-7) if we plot 0 as a function of In X rather than X (X is propor¬ 
tional to the concentration of adsorbate in a dilute solution). 

If each adsorbent molecule has two sites and there is an interaction 
energy w between two adsorbed molecules on the same adsorbent, we ob¬ 
tain Eq. (7-33) again. An example would be the binding of hydrogen ions 
by, say, a diamine with two different sites (—R—NH 2 and —R'—NH 2 ). 
In this case w is positive (repulsion between two hydrogen ions). The 
relation between the statistical mechanical equation (7-33) and the equiv¬ 
alent thermodynamic expression involving successive dissociation con¬ 
stants is explored in Problem 7-8. 
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If each adsorbent molecule has m sites and there are no interactions, 
then we again get the Langmuir equation (Problem 7-9). 

7-4 Elasticity of and adsorption on a linear polymer chain. The first 
problem we examine here is superficially unrelated to the preceding sec¬ 
tions of the chapter. Actually, though, there is an extremely close analogy, 
as will be seen below. We consider a single linear polymer chain composed 
of units each of which can be in a short state a of length l„ or a long state 0 
of length lg (lg > la). The units are interconvertible (as in the chemical 
reactions of Chapter 10). Thus if a pulling force r is applied to the chain, 
some a units will be converted into longer 0 units and the chain will 
lengthen. This model is crudely representative of some real systems: the 
a — 0 transition in fibrous proteins; the helix-random coil transition in 
solutions of proteins and nucleic acids (r = 0); and possibly the elasticity 
of muscle and some textiles. 

Neighboring units in the chain are assumed here to be independent of 
each other. Interactions between units will be considered in Chapter 14. 

Let M be the total number of units, with M a and Mg of the two types. 
We choose M a to be independent; then Mg — M — M a . The funda¬ 
mental thermodynamic equation is 

dE = TdS + rdl + ii'dM, (7-55) 

where l is the length, 

l = laMa + lg(M - Ma), (7-56) 

and ju' refers to a single unit (either a or 0). If we replace l and M as 
independent variables in Eq. (7-55) by M a and M, we have from 
Eq. (7-56), 

dE = TdS- T(lg - la) dMa + (m' + Tig) dM. (7-57) 

We let ja(T) and jg(T) represent the partition functions of one a and 
one 0 unit respectively. The explicit forms used for j m and jg would depend 
on the particular system of interest. The ratio of j a to jg reflects the rela¬ 
tive intrinsic (i.e., unbiased by a force; r = 0) stability of the two kinds 
of units. 

The canonical ensemble partition function is 

where the configurational factor is the number of ways of distributing 
M a a units among a total of M possible positions in the chain. We then 
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Fig. 7-5. Length-force relation for linear polymer chain of a and 0 units. 


find from Eqs. (7-57) and (7-58), 


~ la) 

kT 


J_ (dA\ = / d In Q\ 
kT\dM a /if,T \dM a /M,T 



(7-59) 


where 8 = MJM. Equation (7-59) is essentially the length-force relation 
(Fig. 7-5), since l and 5 are related, from Eq. (7-56), by 


At zero force, r = 0, 


1 - a = 


l - Ml. 
M(l» - l.)' 


5 _ M a jet 

1 — 8 Mfi jfi 


(7-60) 

(7-61) 


This is the intrinsic stability ratio (or “equilibrium constant”; see Chap¬ 
ter 10) referred to above. When r —» oo, 8 —> 0, and l —* Ml 0 (all “long” 
units). 

The similarity between Eqs. (7-8) and (7-59) will be obvious. The 
chemical potential of a lattice gas and the force on a polymer chain play 
similar roles in altering the equilibrium ratio of the two possible states in 
each case (empty and occupied sites; a and 0 units). 

Let us consider also the partition function (Problem 7-10) 

Y= Q(M a ,M,T) v M °, (7-62) 

Afa-0 


y = e 


where 


—nl0—l a )lkT 


(7-63) 
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In Eq. (7-62), Y is the analog of 3 for a lattice gas. Using Eq. 
we obtain 

Y = £*, 

where 


£ = jp + jaV- 


(7-58), 

(7-64) 

(7-65) 


This corresponds to 1 + q\ for a lattice gas. Also, 
fa M a Qi) Ma 


Ida = 




or 

l _ Mi _ / d in A _ j*v 
M jp + jaV 


(7-66) 


which agrees with Eq. (7-59). 

We turn next to a more complicated situation. Suppose that there are 
x equivalent and independent sites on each unit of the polymer chain for 
adsorption of another molecular species. An example would be the binding 
of hydrogen ions. In this case the length-force curve of the polymer would 
depend on the hydrogen ion concentration. Let q„ (qp) be the partition 
function of one adsorbed molecule on a site of an « (ft) unit. In general, 
q a 5^ qp, so now we have the possibility of, say, altering the length of the 
polymer chain at constant force as a consequence of a change in the concen¬ 
tration (or pressure, if a gas) of the adsorbate. For example, if q„ > qp, 
an increase in adsorbate concentration will cause some 0 units to go over 
into a units, thus shortening the chain. It is conceivable that an effect of 
this kind could be involved in muscle contraction. This problem is very 
simple to treat by generalizing £ above (Eq. 7-65), but it is rather cumber¬ 
some using Q (Problem 7-11). 



Fio. 7-6. Shortening of chain with increasing adsorbate concentration at 
constant t in the arbitrary special case x = l, lp/l a = 5, jjjp = 1/10 (ji units 
longer and more stable than a), and qp = 0 (no adsorption on /S units). Curve a: 
r = 0 (rest length as function of X). Curve 6: exp [r(lp — l a )/kT] = 3.5. 
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The thermodynamic equation here is 

dE = TdS — t(1„ - 4) dM„ + (»' + rig) dM + M dN, (7-07) 

where n and N refer to adsorbed molecules. We set up a function £(ij, X, T) 
which is analogous to the two-component £ in Eq. (7-41). We need a term 
in £ for each possible state of a single unit. Thus, the unit can be of type 
P with 0, 1 , ..., x molecules adsorbed (qg), or of type a with 0, 1 ,..., x 
molecules adsorbed (q a ). The appropriate combination of Eq. (7-65) and 
Problem 7-9 is 


Then 


where 


£ == Ml + ?^)* + Jav(l + <?«x)*. 

( 7 - 68 ) 

, (d ln A JaV(l + Qa\) X 

V )\,T £ 

( 7 - 69 ) 

5 = x (^r), r = x W“ s + w ~ 4 )]> 

( 7 - 70 ) 

/. 9 «x „ qg\ 

9 " ~ 1 + q a \ ‘ 1 + qg\ 

( 7 - 71 ) 


Equations (7-71) give the fraction of sites of the two types which are 
occupied, as in Eq. (7-16). In the limit as X —» oo, 8 —> x (all sites occupied) 
and 

idg + umi 


If we solve Eq. ( 7 - 69 ) for ij, we find for the length-force relation 


—In ij 


T(lg - U 

kT 



iSL 

Jfi 


(l + g«xV l 

\1 + 5*V J 


( 7 - 72 ) 


This shows the effect of adsorbate concentration (proportional to X). If 
q„ > qg, the force r must be greater to achieve the same length (or S) 
when X > 0 as compared with X = 0. This is because the presence of 
adsorbate molecules favors short units in this case. Figure 7-6 illustrates 
an equivalent effect: the shortening of the chain with increase in X at 
constant r when q a > qg. 
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Problems 


7-1. Assume that the solid is a continuum with number density p — N/V 
and occupies the semi-infinite region 2 < 0. Let ti(r) in Eq. (IV-1) be the 
interaction potential between a gas molecule and one molecule of the solid. 
Show that the total interaction between a gas molecule at 2 > 0 and the entire 
solid is 


m = 


er* 12 rp 
452» 


tr* 6 xp 

323 


(7-73) 


(Page 126.) 

7-2. Show that the frequency of simple harmonic motion at the bottom of 
the potential well (7-73) is 



(Page 126.) 

7-3. Consider argon gas adsorbed on a solid at 200°K, according to Lang¬ 
muir’s equation, (7-10). In Eq. (7-3), take Uoo as 1500 cal-mole -1 and v* = 
Vy — v, = 5 X 10 12 sec -1 . Calculate 8 when p — 1 atm. (Page 128.) 

7-4. Verify Eq. (7-11) for the entropy of an ideal lattice gas. Make a rough 
plot of S/Mk as a function of 6 over the entire range 0 < 8 < 1. (Page 128.) 

7-5. Extend the derivation of Eq. (7-26) to include Eq. (7-40) for adsorp¬ 
tion from a binary mixture. (Page 136.) 

7-6. Apply the {-method to a two-component ( A , B) ideal lattice gas (inde¬ 
pendent sites; each site can be empty or can be occupied by an A or a B). 
Show that 


^A _ _ gAx _ 

Af 1 + 3 aXa + Qb^b 

(Page 136.) 

7-7. Put Eq. (7-31) in the form 




where A 1/2 is the value of X when 8 «= J. Make a rough plot of 8 against 
In (AA 1 / 2 ) (“titration curve”). Compare with Fig. 7-5. (Page 139.) 

7-8. Consider a system of adsorbent molecules, each with two adsorption 
sites (qi and qi). Let v> be the interaction energy when both sites of a pair are 
occupied. Let <ro, <ri, and <22 be the fractions of adsorbent molecules with zero, 
one, and two adsorbed molecules. Let p be the concentration of adsorbate 
molecules in equilibrium with adsorbed molecules. For a dilute solution, from 
thermodynamics, 

M = p*+kTlap, (7-74) 

where p* is independent of p. We define successive thermodynamic dissociation 
constants by 

K(i) = K( 2 ) = 

<T2 


£0P 
<ri 
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Show that the average number 5 of adsorbed molecules per pair is 

. WK( 2>] + [2p 2 /g (1) K (2) ] 

1 + (p/E(2)] + [P 2 /E(l)E(2)] " 

This is a purely thermodynamic result. Next, we define an “intrinsic” dis¬ 
sociation constant Ki for sites of type 1 by 


_ p(i - e°i) 
*? ’ 


9 iX 

1 + giX 


(7-76) 


These equations refer to adsorption on isolated sites of type 1. Show that Eqs. 
(7-74) and (7-76) give 

-**lkT 

Ki-*——, K 2 - - - (7-77) 

gl ?2 

Using this notation, show that the statistical-mechanical equation (7-33) becomes 

a [(1/Ki) + (l/g 2 )]p-K2e-" a V/g 1 g 2 ) 

1 + [(1 /Ki) + (l/ffa)]p + (e-vinpi/KiKs )' U 1 

Finally, compare Eqs. (7-75) and (7-78) to deduce the connections 


(Ei + K 2 )e'""‘ T , 


J_ = JL + JL. 

E<2) Ki ^ Ki 


Give a physical interpretation of Eqs. (7-79). (Page 139.) 

7-9. For a system of adsorbent molecules each of which has m equivalent 
and independent sites for adsorption, show that { — (1 + q\) m and that the 
adsorption isotherm is the same as the Langmuir equation, (7-31). (Page 140.) 

7-10- Use the method of Eq. (1-92) to find the characteristic thermodynamic 
function for the partition function Y. Use this result and thermodynamics to 
show that 



(Page 141.) 

7-11. Derive Eq. (7-72) for the length-force relation of an adsorbing polymer 
chain from the canonical partition function Q(M a , M, N, T). (Page 142.) 

7-12. Derive the properties of an ideal lattice gas using the N, 4>, T partition 
function, Eq. (1-87). 

7-13. Deduce, from Eq. (7-6), the thermodynamic equation for a lattice gas 
(or adsorbed phase) 

#(p) = kT ( 6(j/) d In jf (T constant), 

Jo 


assuming that the equilibrium gas phase (pressure p) is ideal. Use this equation 
to check the self-consistency of Eqs. (7-10) and (7-15). 
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7-14. If adsorption in the B.E.T. theory is limited to n layers (by pore struc¬ 
ture in the adsorbent, for example), show that 

5 fgjl — (» + l)s" + na: lt+1 ] 

(1 — x)(l — x + ex — cx n+1 ) 

7-15. Derive the B.E.T. equation, (7-36), from the complete grand partition 
function 3. For a given N, let Ni be the number of molecules in the first layer 
and N' be the number in higher layers, where Ni + IV' = N. Then Q(N, M, T) 
will involve a sum over either N i or N'. 

7-16. Show that 77 = X(d In T/dX) in Eq. (7-51) follows from fclT In T = 
pV - p'M and Eq. (7-46). 

7-17. Discuss the equilibrium between gas and adsorbed phase (Langmuir 
model) in terms of an energy-entropy competition (see Section 5-3). 


Supplementary Reading 
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S. M., Chapter 7 and Appendixes 4 and 5. 
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CHAPTER 8 
IDEAL DIATOMIC GAS 

In Chapter 4 we studied an ideal monatomic gas. We extend the dis¬ 
cussion of ideal gases here to those composed of diatomic molecules, and 
then in Chapter 9 we consider polyatomic molecules. An important ap¬ 
plication of the results obtained in these two chapters is to chemical equi¬ 
librium and chemical kinetics in ideal gas mixtures (Chapters 10 and 11). 

8-1 Independence of degrees of freedom. We shall present in this 
chapter and the following one the simplest possible treatments, based on 
the rather good approximation of independence of degrees of freedom. 
Brief mention of this approximation has already been made in Chapter 3 
[see Eqs. (3-13) and (3-14)], but we discuss it a little more fully here. 

Consider a diatomic molecule consisting of two nuclei (masses mi and m 2 ) 
and several electrons. In an exact quantum-mechanical treatment of this 
system, all these particles and their (coulombic) interactions have to be 
included in a single Schrddinger equation. However, because the electrons 
are very light compared with the nuclei and hence move much faster on 
the average, it is an excellent approximation (called the Bom-Oppenheimer 
approximation) to consider the nuclei fixed while studying the electronic 
motion. The procedure, then, is the following. For nuclei fixed at some 
distance apart r, we write and solve the Schrddinger equation in the 
coordinates of the electrons. The potential energy in this equation is made 
up of electron-electron, electron-nucleus, and nucleus-nucleus coulombic 
interactions. The last of these interactions is of course cpnstant for con¬ 
stant r. A set of (electronic) energy levels is obtained, which will depend 
on the value of r. In particular, let the dependence of the ground state 
electronic energy on internuclear distance be denoted by it,(r). This 
function can be calculated accurately only for very simple molecules (one 
and two electrons). In more complicated cases it must be approximated 
or deduced semiempirically from experimental information. The typical 
form of ««(r) is shown in Fig. 8-1. It is qualitatively similar in appearance 
to the van der Waals interaction curve, Fig. IV-1, but is much deeper 
(of the order of 100 kcal-mole -1 ), and the minimum occurs at smaller 
values of r (often 1 or 2 A). The depth of the u ,(r) well is of course essen¬ 
tially the energy of the chemical bond between the two atoms. The dashed 
curve in Fig. 8-1 is a schematic plot of the first excited electronic energy 
level as a function of r. 
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0 


*- D, + |Ai> 

'^-D. + ihr 


Fig. 8-1. Ground (and first excited) state electronic energy u, as a function 
of internuclear separation r. The first few vibrational levels are shown. 


We can now investigate the (relatively slow) motion of the two nuclei. 
The work required to change the value of r when the molecule is in its 
ground electronic state is determined by «,(r). That is, u e (r) serves as the 
potential function for the nuclear motion. A second Schrodinger equation 
is then written, this time in the six coordinates (x x , y u z it x 2 , V 2 , 22 ) of 
the two nuclei with u«(r) as the potential function. Since we want to 
consider the molecule to be in a box of volume V, we must add that the 
potential becomes infinite when the center of mass of the molecule is out¬ 
side this box. The next step is to change to the coordinates x, y, z, r, 0, <p, 
where x, y, and 2 refer to the position of the center of mass and r, 0, and <p 
are indicated in Fig. 8-2. With these new coordinates, the equation 
separates into two equations, one in x, y, and 2 (translation) and one in 
r (vibration), 0 and <p (rotation). The x, y, z equation is just that of a 
particle of mass + m 2 in a box of volume V. This problem has al¬ 
ready been discussed in Section 4-1. We can take over the results ob¬ 
tained there simply by replacing m by mi + m 2 . The energy levels from 
the r, 0, <p equation are difficult to calculate [for one thing, u e (r) is not 
a simple function] and are not simply additive (vibration + rotation). 
Therefore we make two further approximations. First, we replace u e (r) 
by a parabola which fits u«(r) in the neighborhood of its minimum. Second, 
we separate the vibrational and rotational motions from each other by 
assuming that, as far as rotation (0, <p) is concerned, the molecule has a 
fixed internuclear distance r«, where r e is the (“equilibrium”) value of r 
at the minimum in u,(r). The rotational motion is therefore approximated 
as that of a rigid, linear, free rotator and the vibrational motion as that 
of a simple harmonic oscillator (a problem already discussed in Sec¬ 
tion 5-1). 
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Fig. 8-2. Vibrational (r) and rotational (0, <p) coordinates of a diatomic 
molecule. The center of mass is at the origin. 


We have then the basic equations [see (8-13) and (3-14)] for an ideal 
diatomic gas, 

H = H, + H r + H v + H e , (8-1) 

« = e, + e r + *» + ««, (8-2) 

Q = q < V ’ T) = QdV, Dqr(T)q,(Dq.(D. (8-3) 


The rotational, vibrational, and electronic states are obviously inde¬ 
pendent of the volume, so the corresponding g's are functions of T only. 
We shall discuss q v and q r in Sections 8-2 and 8-3, respectively. For 
q t , as already mentioned, we have from Eq. (4-10), 


qt 


r 27r(mi + m 2 )fcT , l 3/2 T7 

L ] r - 


(8-4) 


In the notation of Eq. (4-42), q, is simply 


q, — w.ie 


-UlIkT 


(8-5) 


We shall usually choose the zero of energy as the separated atoms (r —* oo 
in Fig. 8-1) at rest. If the vibrational energy levels are denoted by e n , 
relative to the customary choice of the bottom of the potential energy well 
u,(r) as zero, and if the depth of the well (Fig. 8-1) is D,(> 0), then 
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q t q„ relative to our choice of separated atoms as zero is 

M. = «.i £ = c el e D -'” 

» » 

q> 

Thus it is self-consistent to put e, 1 = — D, in Eq. (8-5), as we shall do, 
and write q v (in Section 8-2) relative to the bottom of the well as zero. 

Ordinarily, just as for a monatomic gas, it is only the ground electronic 
state which is of interest because of the large separation between electronic 
energy levels, relative to kT. However, there are exceptions; for example, 
NO (above 15°K) and O 2 (above 1000°K). In these cases, assuming as 
a further approximation that the rotational and vibrational states are the 
same in the different electronic states, q, is still a separate factor in Eq. 
(8-3), and we write 

q. = u> tl e~ tnlkT + a e2 e^" lkT . (8-6) 

As a consequence of Eq. (8-3), many of the thermodynamic functions 
receive additive contributions from translation, rotation, etc. For example, 

A = —kT]nQ = A t + A r + A v + A„ (8-7) 

where 

A t - NkT In . (8-8) 

A r — —NkT In q r , A v = —NkT In q v , etc. (8-9) 
Equation (8-8) is the same as Eq. (4-14), except for the mass. Also, 

E = kT 2 (^) jv r = E t + E r + B, + E t , (8-10) 

where 

E t = iNkT, (8-11) 

E r = NkT 2 , etc. (8-12) 

Similar equations can be written for Cv, S, etc. 

The equation of state, 



does not depend on q r (T), q v (T), etc. [see also Eq. (3-12) and Problems 
3—1 and 3-6]. 

To continue the discussion, we need further details concerning q v and q r . 



8—2J 


VIBRATION 


151 


8-2 Vibration. The vibrational motion of a diatomic molecule involves 
only one coordinate, the intemuclear distance r. Thus, there is only one 
vibrational degree of freedom. For the potential energy that determines 
the vibrational motion, we take 

u«(r) = —D t + if(r — r e ) 2 , (8-14) 

as already explained, where / is the force constant, 



(Problem 8-1). Since we wish to use the bottom of the potential well as 
the zero of vibrational energy, we drop D, in Eq. (8-14) and include it 
instead in Eq. (8-5) for q„ as mentioned earlier. 

Let the line between centers of the two nuclei be the f-axis, with arbi¬ 
trary origin and, say, f 2 (location of nucleus of mass m 2 ) >fi. Then 
r = f 2 — fi, and 

«. = i/(fa — f i — r,) 2 . 

The classical equations of motion are 


wifi = — = /(fa — fi — r«), 

Wafa = — = —/(fa — fi — »"«)• 


We multiply the top equation by — w 2 , the bottom by m lt add the two 
equations, and divide by w t + w 2 . Then 


where 


M2 = -Jx, 


x = r - r„ 


WjW 2 

Wi + w 2 


(8-16) 

(8-17) 


This is the equation of motion of a one-dimensional harmonic oscillator 
with mass fi (called the “reduced mass”). This is the standard symbol 
used, and is of course not to be confused with the chemical potential. The 
frequency of the classical motion is 

-=<*-”» 

In quantum mechanics, the energy states are 

«» = (» +- i)hv, n = 0,1,2,..., 
where v is the classical frequency, (8-18). 


(8-19) 
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The vibrational partition function is therefore (Eq. 5-6) 


Qv = 


9,I2T 


1 - 


9.IT 


( 8 - 20 ) 


where 0„ = hv/k. The vibrational contributions E v , Cm, S v , etc., to the 
thermodynamic functions of an ideal diatomic gas are just those of N one¬ 
dimensional harmonic oscillators all of the same frequency v. These are 
the same functions as for an Einstein crystal (3 N one-dimensional oscil¬ 
lators), except for a factor of three. For example, 


and 



( 8 - 21 ) 

( 8 - 22 ) 


A law of “corresponding states” obviously applies, since Cw/Nk is a 
function of O v /T only. 

Although the Einstein equations are applicable, the values of v and 
therefore 0. are in general higher here by a factor of about ten because of 
the fact that chemical bonds are relatively strong (D e ) and stiff (/) relative 
to van der Waals and other types of “bonds” in most crystals. In prin¬ 
ciple, u,(r) and therefore /, v, and 0* can be calculated from quantum 
mechanics. But in practice, these quantities are usually deduced from the 
experimental vibrational spectrum. Table 8-1 gives a few values of 0, 
and 0 r (defined in Section 8-3). 

To take two specific numerical examples, the value 0„ = 3340°K 
for N 2 corresponds to v = 6.96 X 10 1S sec -1 . At 300°K, ©„/T = 
hv/kT = 11.1. Hence the fraction of molecules in excited vibrational 
states (n > 0) at this temperature is 




1 - 




= e*' ,T =e ~ ni . 


This order of magnitude is typical for most diatomic molecules, as is clear 
from Table 8-1. That is, at room temperature, the vibrational degree of 
freedom is practically “unexcited” and makes only a very small contri¬ 
bution to CV, etc. (Problem 8-2). On the other hand, a few molecules 
have relatively loose vibrations and small values of 0„. For example, 
0®/T = 1.57 for Br 2 at 300°K, and the fraction of Br 2 molecules in 
excited vibrational states at this temperature is 0.21. 

The vibrational contribution to the heat capacity CV follows the Ein¬ 
stein curve in Fig. 5-2 and ranges from zero at low temperatures to Nk 



8-3] 


ROTATION 


153 


Table 8-1 

Parameters for Diatomic Molecules 



e.,°K 

e n °K 

r«, A 

Do, ev 

h 2 

6210 

85.4 

0.740 

4.454 

n 2 

3340 

2.86 

1.095 

9.76 

02 

2230 

2.07 

1.204 

5.08 

CO 

3070 

2.77 

1.128 

9.14 

NO 

2690 

2.42 

1.150 

5.29 

HC1 

4140 

15.2 

1.275 

4.43 

HBr 

3700 

12.1 

1.414 

3.60 

HI 

3200 

9.0 

1.604 

2.75 

Cl 2 

810 

0.346 

1.989 

2.48 

Br 2 

470 

0.116 

2.284 

1.97 

Is 

310 

0.054 

2.667 

1.54 


(i.e., R per mole) at high temperatures. The upper limit is predicted by 
classical statistical mechanics (see Problem 6-6). The value O.INk for 
Gw is reached at T = 8„/5.8, 0.5 Nk at I T = 0»/3.O, and 0.9 Nk at 
T — 0„/l.l. In the Br 2 example above (at 300°K), Cw — 0.&2Nk. 

In Table 8-1 and Fig. 8-1, Do is the dissociation energy of the diatomic 
molecule at 0°K. The relation to D e is 

D 0 = D, — \hv. 

Although D, cannot be measured by a direct experiment, it may be 
calculated from experimental values of D 0 and v. 


8-3 Rotation. The two ends of a symmetrical diatomic molecule are 
indistinguishable, just as two monatomic molecules of the same species 
in a gas are indistinguishable. This feature leads to symmetry complica¬ 
tions and the involvement of nuclear spin in a quantum-statistical treat¬ 
ment of the rotation of these molecules. We postpone such a treatment 
until Chapter 22. Fortunately, as we shall see below, a quantum treat¬ 
ment is really necessary only for very light molecules (e.g., hydrogen) at 
low temperatures. Hence, the postponement referred to is not a serious 
limitation. 

These complications do not arise with unsymmetrical diatomic mole¬ 
cules. Hence we can begin with a discussion of the quantum-mechanical 
q, for such molecules. In quantum mechanics one finds that the energy 
levels of a rigid linear rotator are 


«/ 


Kj +1 )h 2 

~ 8ir 2 7 ’ 


j = 0,1,2,..., 
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with degeneracy wy = 2j + 1, and where 7 is the moment of inertia 
about the center of mass. For a diatomic molecule, I — pr, 2 (Problem 
8-3). Therefore, 

q r (T) = Y^ 03 >e~ <,lkT = Z ( 2 i + l)e~ Hi+l) *’ IT , (8-23) 

i i-0 


where 


r St 2 Ik 


(8-24) 


In Eq. 8-24, 9 r is the characteristic temperature for rotation. The sepa¬ 
ration Ac between successive rotational levels, relative to kT, is 0 r /T 
multiplied by a simple function of j. Hence the sum in Eq. (8-23) may be 
replaced by an integral when T » 0 r . This will lead to the high- 
temperature or classical limit of q r . Table 8-1 includes values of 0 r for a 
number of diatomic gases. It will be seen from the table that for ordinary 
gases at ordinary temperatures or even fairly low temperatures, T is 
indeed much larger than 0 r . In principle again, r„ and hence 7, may be 
calculated from quantum mechanics (Fig. 8-1), but in practice 0 r , 7, 
and r„ are deduced from the experimental vibrational-rotational or pure 
rotational spectrum. 

The sum in Eq. (8-23) cannot be put in closed form. However, it is 
easy to use numerically, as it stands, at low temperatures (Problem 8-4). 
At high temperatures, 

qr -*• r (.2j + l)e~ Hi+m ' IT dj 

JO 

= f* e~ M+l » rlT <Hj(j + 1)] 

Jo 


T 8iSlkT 

©r h 2 


(8-25) 


When T is not quite high enough* for use of Eq. (8-25), 

* = 4;( l+ 5?+'--)- <8 - 26) 

The corresponding equation for q v , incidentally, is given in Problem 6-2. 
As a further check on the validity of Eq. (6-14) for q 0 i ua , we have 

+« 2t r 

g,i». = p JJ f fe- H ' lkT d6d<pdp,dp„ (8-27) 

—00 0 0 


* Mayer and Mayer, pp. 151-154. 



8-31 


ROTATION 


155 


where (Problem 8-5) 




2 \ 
sin 2 ej 


(8-28) 


On carrying out the integrations (Problem 8-6), we get Eq. (8-25) again. 

The expression in Eq. (8-25) is also the classical partition function of a 
particle of mass /x moving freely on the surface of a sphere of radius r, 
[see Eq. (7-17)]: 


2tijJcT 

*' = ->w~ 


(4rr?) = 


8 iflkT 
h* 


The above discussion pertains to unsymmetrical diatomic molecules. 
The principal conclusion we reach, in view of Table 8-1, is that we can 
generally regard rotation as classical—or fully “excited,” as it is some¬ 
times put. Table 8-1 also contains 6 r values for symmetrical molecules. 
Except for hydrogen (and deuterium), these are of the same order of 
magnitude as for unsymmetrical molecules. Hence the same conclusion: 
generally rotation of symmetrical molecules can be treated classically. 
We may therefore apply Eq. (8-27) to symmetrical molecules also, but 
with one correction. For exactly the same reason that division by N\ is 
included in Eq. (6-18), we have to divide here by two if the diatomic 
molecule is symmetrical. If we do not do this, the two indistinguishable 
configurations in Fig. 8-3 will both be counted in the integration over 6 
and <p in Eq. (8-27) [see the discussion following Eq. (6-18)]. 

Thus we can write for the high-temperature (classical) q r . 


T &K 2 IkT 
9r ~ <tB t ~ oh* ’ 


(8-29) 


where o = 1 for unsymmetrical molecules and o — 2 for symmetrical 
ones. The constant o is called the symmetry number. 



Fio. 8-3. Two indistinguishable configurations of a symmetrical diatomic 
molecule. 
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T/Qr 

Fig. 8-4. Rotational heat capacity of an unsymmetrical diatomic molecule. 

Equation (8-29) leads to very simple rotational contributions to thermo¬ 
dynamic functions. For example, 


Exj® 

^ 1 fe, 

-S 

E« 

£ 

1 

II 

(8-30) 

Er = NkT 2 dl j T 9r = NkT, 

(8-31) 


(8-32) 

- (?w\ - 

(8-33) 


Equations (8-31) and (8-33) could have been predicted immediately 
from Eq. (8-28) for H r and from Problem 6-6. There are two rotational 
degrees of freedom (6, <p), and E r represents kinetic energy only (unlike 
E v , which has equal kinetic and potential contributions). Incidentally, 
the rotation of a diatomic molecule in an imperfect gas, liquid or solid, 
does involve a potential energy and is therefore not “free” rotation, as in 
the present situation (ideal gas). 

We can also derive low-temperature rotational contributions to thermo¬ 
dynamic functions for unsymmetrical molecules directly from the sum 
(8-23). For example, Fig. 8-4 gives Cvr/Nk as a function of T/9,. 
Again we have a law of corresponding states. The heat capacity has 
practically its classical value for T > 1.50 r . An important example is 
HD(0 r = 64°K), for which the theoretical Crr curve, including the 
maximum, has been confirmed experimentally (Problem 8-7). 

8-4 Thermodynamic functions. Our main object in this section is to 
bring together the various contributions to the thermodynamic functions 
of an ideal diatomic gas. We shall set down complete equations only for 
the most important special case: translation and rotation treated classi- 
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cally; vibration treated quantum-mechanically; and electronic degrees of 
freedom unexcited (i.e., only the ground electronic state included). Be¬ 
fore writing the equations, we emphasize again that (a) the present 
analysis, in which the various contributions are additive, is somewhat 
approximate; and (b) the question of classical (fully excited) vs. quantum 
vs. unexcited (i.e., ground state only) for a particular type of degree of 
freedom is determined by the magnitude of the appropriate energy-level 
separation relative to kT. In typical cases (Problem 8-8): 

Ac (translation) = O(10 -18 ev), Ae (vibration) = 0(0.3 ev), 

Ae (rotation) = 0(5 X 10 -4 ev), Ac (electronic) = 0(5 ev), 

fcr(300°K) = 0(0.03 ev). 


The criteria are: 

Classical: Ac « kT, 

Quantum: Ae = 0(kT), 

Unexcited: Ac » kT, 


© « T. 

0 = 0(T). 
©» T. 


In the special case referred to in the above paragraph, we find from 
Eq. (8-7), etc., 


A . r2ir(wi 1 + m 2 )fcr] 3/2 Ve , , 8r 2 IkT 

NkT^H -F-J N- + ln ~^hT- 

~ ^T ~ ln (1 ” e ' k, " kT) + ^ + ** «.i5 

E 3.2. hv , hv/kT D, 

NkT 2 " l ~ 2 "*■ 2kT e^itr — i kT ’ 


Gy _ 
Nk ~ 

A. 

Nk ~ 


3+i + (A) 

2^2^ \kT/ 


e 


,h*lkT 


[ehoIkT — 1)2 » 

[j 3 


+ 


2jt(»»i + m 2 )kT\ 312 Ve m . , 8ir 2 IkTe 
- ~W + hl ~ah2~ 

hv/kT 


ehrlkT 


j — In (1 — e~ hr,kT ) + ln w el ; 


pV = NkT. 


(8-34) 

(8-35) 

(8-36) 

(8-37) 

(8-38) 


Ordinarily, = 1. The molecules 0 2 and NO are exceptions (see 
below). The order of terms in these equations is: translation; rotation; 
vibration; and electronic. The heat capacity Cy is thus 5Nk/2 (transla¬ 
tion, rotation) for T « ©„ = hv/k and rises to 7Nk/2 (vibration classical) 
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for T » 0*. The form of the Cv curve between these two limits (Einstein 
curve in Fig. 5-2) has been verified experimentally in a number of cases. 
At very low temperatures, rotation is no longer classical, and Cv falls from 
5Nk/2 to ZNk/2 as the temperature decreases. It is not practical to 
observe this effect experimentally, however, except for very light gases 
(e.g., HD). Figure 8-4 (unsymmetrical molecules) gives Cvr for HD if 
we put 0, = 64°K. For HD, 0„ is 6100°K, so that the two observable 
transitions (owing to exciting rotation and vibration) in Cv from 3 Nk/2 
to bNk/2 to lNk/2 are very widely separated on the temperature scale. 

For the chemical potential, we obtain [see the remarks in connection 
with Eq. (4-26)] 


f _ ±_ , pv_ _ jl_ **°(r) . In _ 

NkT ~ NkT r NkT ~ kT ~ kT ^ p ’ 


(8-39) 


where 


M°(P 

kT ~ 





+ In (1 - e~ h " kT ) - ^ - In (8-40) 


In the above equations the zero of energy has been chosen as the sepa¬ 
rated atoms at rest, and the zero of entropy corresponds to a single (G = 1) 
translation-rotation-vibration-electronic quantum state for the macro¬ 
scopic system of N diatomic molecules, i.e., the pure crystal at 0°K. 

We have already referred briefly to the agreement found between experi¬ 
mental Cv measurements and the above theory. Experimental checks of 
the entropy equation, (8-37), will be mentioned in the next chapter. In 
addition, Eq. (8-39) for the chemical potential has been verified by 
measurements of crystal vapor pressures* [compare Eq. (5-50)] and experi¬ 
mental gaseous equilibrium constants (Chapter 10). 

For NO and 0 2 we must usef Eq. (8-6) for q e : 

q , = e D ' lkT [o> el + W(2 e-“‘>-‘‘ l)/ * 7 ’], (8-41) 

NO; = 2, w e2 = 2, (e e2 — c«i )/k = 178°K, 

0 2 : <a,i — 3, U|2 = 2, (e <2 — €«i )/k — 11,300°K. 

Equation (8-41) leads to a significant electronic contribution to Cv for 
NO, with a maximum predicted (Problem 8-9) at 74°K. For a con- 


* See Fowler and Guggenheim, pp. 202-205. 
t See Fowler and Guggenheim, pp. 102-106, for further details. 
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siderable temperature range (see Table 8-1) on both sides of 74°K, 
Cvt + Cvr = 5Nk/2 and Cw = 0. The experimental variation of Cv 
with T in this region is therefore due to Cy«. Theory and experiment have 
been checked between 125°K and 180°K. 

Finally, it should be mentioned that instead of deriving thermodynamic 
functions from the slightly idealized energy levels of a simple harmonic 
oscillator and rigid rotator, one can use instead actual energy levels de¬ 
duced from spectroscopy. This is obviously a more accurate procedure, 
but one which we shall not discuss. The approximation we have used in 
this chapter takes care very adequately of all the first-order effects. 

Problems 

'8-1. The empirical Morse function for u t (r) is 

«,(r) - D.{[1 - - 1}. 

Find the force constant f in terms of D, and a. (Page 151.) 

8-2. Calculate CV, in cal-molc _1 -deg _1 for N 2 and CI 2 at 300°K, 900°K, 
and 2700°K. (Page 152.) 

8-3. From the definitions of center of mass and moment of inertia, show 
that I = prf for a diatomic molecule. (Page 154.) 

8-4. For HD gas at 96°K, use Eq. (8-23) to calculate q„ E, (cal-mole -1 ). 
and Cvr (cal-mole -1 -deg _1 ). Take ©, «■ 64°K. Compare this value of q, with 
that obtained from the classical q r . What arc the values of E in cal-mole -1 
and C, in cal*mole -1 -deg -1 for HD at 96°K? Also, find the fraction of HD 
molecules in each of the first four rotational energy levels at 32°K, 96°K, and 
256°K. (Page 154.) 

8-5. In the free rotation of a diatomic molecule with center of mass fixed 
at the origin, each nucleus moves over the surface of a sphere. Express the 
kinetic energy of the two nuclei in cartesian coordinates, change to spherical 
coordinates, and hence show that the kinetic energy is 

+ 4? sin 2 6). 

From this, derive Eq. (8-28). Show that a single particle of mass n moving on 
a sphere of radius r, has this same kinetic energy. (Page 155.) 

8-6. Derive the result q r = T/9 r from the classical phase integral, Eq. 
(8-27). (Page 155.) 

-8-7. Deduce the value of 0 r for HD from 9 r = 85.4°K for H 2 . (Page 156.) 
8-8. Verify the orders of magnitude listed near the beginning of Section 8-4 
for Ae (translation, rotation, etc.) What wavelengths of light in A and wave 
numbers in cm -1 do these energies correspond to (At = hv — hc/\)1 (Page 157.) 
- 8-9. With the aid of Problem 4-8, verify that CV« for NO has a maximum 
at about 74°K. (Page 158.) 

>-8-10. From spectroscopy it is found that 0, = 9.0°K for HI. Calculate, 
(a) the moment of inertia of HI in cgs units, and (b) the equilibrium inter- 
nuclear separation r, of HI in A. 
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8-11. Calculate the fraction of CI 2 molecules in the first four vibrational 
states at 200°K, 800°K, and 3000 # K. 

8-12. From Table 8-1 calculate the vibrational force constant/in dyne-cm -1 , 
the frequency v, and wave number in cm -1 for HC1, CI 2 , and I 2 . 

8-13. From Table 8-1 calculate D, in cal-mole -1 for H 2 . 

8-14. Calculate S and C p in cal-mole _1 -deg _1 and p in cal-mole _1 for N 2 
and HBr at 25°C and 1 atm pressure (see Table 9-2). The value of «,i is unity 
for both gases. 

8-15. Calculate the work (in cal-mole -1 ) necessary to stretch the chemical 
bond in HC1 from r = r, to r = r, + 0.1 A. 


Supplementary Reading 
Fowler and Guggenheim, Chapter 3. 

Herzberg, G., Molecular Spectra and Molecular Structure. 2nd ed. New 
York: Prentice-Hall, 1950. 

Mayer and Mayer, Chapters 6 and 7. 

Rushbrooke, Chapters 6 and 8. 
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CHAPTER 9 

IDEAL POLYATOMIC GAS 

In this chapter we make a direct extension to polyatomic molecules of 
the methods and approximations used in the previous chapter for diatomic 
molecules. One new feature arises: hindered internal rotation (in molecules 
such as ethane), which is considered in Section 9-5. We also include in 
this chapter a section (9-6) on the transition from localized to mobile 
adsorption (these terms were introduced in Section 7-1). Offhand this 
appears to be a completely unrelated subject, but actually this transition 
can be described by the same equations as those developed for the tor¬ 
sional oscillation—>free internal rotation transition in ethane, etc. 

9-1 Potential energy surface. A polyatomic molecule consists of three 
or more nuclei and, usually, many electrons. To deduce the thermo¬ 
dynamic properties of an ideal (i.e., very dilute) polyatomic gas, we first 
need the quantum-mechanical energy levels for a single polyatomic mole¬ 
cule in a box of volume V. This problem is extremely, if not hopelessly, 
complicated unless we make the same simplifications as in the previous 
chapter for diatomic molecules. Actually, there are several refinements 
that could have been introduced without much complication in our treat¬ 
ment of diatomic molecules, but such refinements are not practical here 
because of the greater complexity of the problem. 

For a given fixed configuration of the nuclei of one molecule, we first 
study the quantum-mechanical problem of the motion of the electrons. 
This leads in principle to a set of electronic energy levels of which only 
the ground state is of much interest to us. If we vary the location of the 
nuclei relative to each other, the ground state electronic energy u, varies. 
The (relative) nuclear configuration of a diatomic molecule can be repre¬ 
sented by a single variable r, the intemuclear distance. One can then 
use a curve (Fig. 8-1) to show how u, depends on relative nuclear con¬ 
figuration. But for a polyatomic molecule, u» is a function of 3n — 5 
variables for a linear molecule, or 3n — 6 variables for a nonlinear mole¬ 
cule, where n is the number of nuclei in the molecule. This follows because, 
of the total of 3n coordinates needed to locate the n nuclei in space, three 
are used up on the position of the center of mass of the molecule and 
two (linear) or three (nonlinear) on the rotational orientation of the 
whole molecule in space. The remainder are the coordinates (vibra¬ 
tional degrees of freedom) having to do with the position of the nuclei 
relative to each other and are the only ones on which u e depends. Thus if 
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n > 3, u e is a function of at least three variables. We have to imagine 
then that w« is represented by a surface in a space of at least four dimen¬ 
sions. If the molecule is a stable molecule, this u t surface necessarily has a 
minimum (in some cases, several equivalent minima). The nuclear con¬ 
figuration corresponding to the minimum in u, is the stable configuration 
of the molecule (r = r, in Fig. 8-1). 

The ground state electronic energy u e is used as the potential function 
for the nuclear motion (Born-Oppenheimer approximation). Hence the 
it, surface is often called a potential surface. An exact calculation of the 
potential surface from quantum mechanics is too difficult mathematically, 
so an approximation, or more often, experimental information, is used to 
construct it (near the minimum). 

The translational motion (3 degrees of freedom) of the center of mass of 
the molecule is separable and again leads to (Eq. 8-4) 

qt = ] 3/ V, (9-1) 

where the sum is over all atoms in the molecule. 

The rotational motion (two or three degrees of freedom) is made separa¬ 
ble by assuming that, as far as rotation is concerned, the molecule has the 
rigid structure corresponding to the minimum in the potential surface. 

An analysis of the vibrational motion (3n — 5 or 3n — 6 degrees of 
freedom) is rendered practical by use of a quadratic approximation [see 
for example, Eq. (V—1)] to the true potential surface at the minimum. 
Even with this approximation, a normal coordinate analysis of the vibra¬ 
tional motion is still required. 

Because of the large separation in electronic energy levels, we shall 
be concerned with only the ground electronic state. Hence 


qe = (9-2) 

For ordinary chemically saturated molecules, o» e i = 1. Probably the 
most convenient choice of the zero of energy is completely separated atoms 
at rest. If the depth of the potential minimum is D, (>0) relative to this 
reference point, we put = — D e , and choose the bottom of the potential 
surface as the zero of vibrational energy. 

Having carried over the approximations of Section 8-1 to polyatomic 
molecules, we can apply Eqs. (8-1) through (8-3) and (8-7) through (8-13) 
here without modification. 

Potential energy surfaces are also discussed in Section 11-1. 

9-2 Vibration. Let us assign cartesian coordinates to each nucleus in 
the molecule, xi, y u Z \,..., x n , y„, z„, using as respective origins the posi- 
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tions of the nuclei when the molecule is in its stable or equilibrium (bottom 
of u e ) configuration. For small vibrations about the equilibrium con¬ 
figuration, the Hamiltonian will have the form of Eq. (V-l), but extended 
to three dimensions. The potential energy u t (relative to the minimum) 
will be a quadratic function of relative coordinates such as x 2 — * 1 , 
%z — xi, y 2 — yu etc. To make further progress we must perform a 
normal coordinate analysis as in Appendix V. This can be carried out for 
fairly complicated molecules. Symmetry in the molecular structure and 
simplifications in the assumed form of u e help. Of the 3n normal co¬ 
ordinates, five (linear) or six (nonlinear) will turn out to be associated 
with motion of zero frequency, that is, with translation or rotation. The 
remaining 3» — 5 or 3» — 6 normal coordinates are connected with vi¬ 
bration (except for internal rotation—a complication discussed in Section 


9-5 and not present in the molecules under consideration up to that 
section). 

Corresponding to the diagrams (V-10) and (V-ll) of normal modes 
in one dimension, the following are simple schematic examples in three 
dimensions: 

C0 2 

—© © 

©— 

d 

HjjO 

•O 

oo 

o 



Cf Q 

O © 

© © 

©1 

>-Same v 

© J 


? 

t) 

cf 


The + and — signs refer to displacements perpendicular to the paper. 

Each normal mode of vibration makes an independent contribution to 
thermodynamic functions such as E, Cy, S, etc., as we have already seen 
in Section 5-2 in discussing crystals. Incidentally, we remind the reader 
that the vibration problems in molecules and in crystals do not differ at 
all in principle. In practice, (a) the force constants in molecules are 
usually larger, and (b) the number of atoms in crystals is so large that we 
can ignore (i) translational and rotational degrees of freedom and (ii) 
edge effects. 

If the normal frequencies v u v 2 ,..., v n >, where «' = 3n — 5 or 
3» — 6, are known, we have immediately (see Sections 5-2 and 8-2) 
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equations such as 

n' g-6»/2r 



9v "El- c-e*/r ’ 

(9-3) 

E v 

-«£(*•+,., ?L,)' 

(9-4) 


«—1 


Cw 

- “£[(?') (*//-!,*]■ 

(9-5) 


where 0,- = hvi/k. 

In principle, u„ and hence the v it can be calculated by quantum me¬ 
chanics. In practice, the iq for polyatomic molecules must always be 
deduced empirically from vibrational (infrared and Raman) spectra. 

Numerical examples will be included in Section 9-4. 

9-3 Rotation. If the equilibrium configuration (minimum in w,) of a 
polyatomic molecule is linear, as for example in CO 2 , C 2 H 2 , N 2 0, etc., 
the rotational problem is exactly that already discussed in Section 8-3. 
The molecules are heavier here, the moments of inertia (about the center 
of mass) are larger, and the classical equations are practically always 
appropriate. Thus we take over Eqs. (8-29) through (8-33) without 
change. In the examples mentioned above, a = 2 for the symmetrical 
molecules C0 2 and C 2 H 2 , but <r = 1 for N 2 0 (structure: NNO). 

If the equilibrium structure is nonlinear, three rotational coordinates 
instead of two are required. For example, in addition to two angles, say 
6 and necessary to fix the orientation of an axis in space (as for a linear 
molecule), a third angle is also needed because of the possibility of rotation 
of the molecule around the axis itself. 

The rotational properties of a nonlinear rigid body can be expressed 
most simply in terms of the so-called principal moments of inertia, which 
we now define. First, we locate the center of mass of the equilibrium 
configuration of the molecule. To do this, we use the following definition: 
if the center of mass is chosen as origin of cartesian coordinates for all 
nuclei in the molecule, then (Problem 9-1) 

2 m<x< = 12 m ^ i = 12 mfii ~ °- (9-6) 

tit 

Now consider any straight line passing through the center of mass. A 
moment of inertia can be calculated about this line: I = £,»»,• d 2 , 
where di is the perpendicular distance of the mass tm from the line. On 
this line, mark off a distance on both sides of the center of mass propor¬ 
tional to / -1/2 calculated about the line. Now choose other lines through 
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the center of mass, and on each mark off distances proportional to (same 
proportionality constant) 7 -1/2 about the line. The locus of the marks 
is an ellipsoid,* called the ellipsoid of inertia. The longest marked-off 
line is one of the principal axes of the ellipsoid, and the moment of inertia 
about this line (that is, the minimum 7) is called a principal moment of 
inertia. The shortest line is also a principal axis, and the associated 7 
(the maximum 7) is a principal moment. The third principal moment is 
that about the line perpendicular to both the lines just referred to. Let 
us call the three principal moments of inertia I a, Ib, and 7c. If all three 
moments are equal, the molecule is called a spherical top (e.g., CH 4 ); if 
only two are equal, the molecule is a symmetrical top (e.g., CgHg and 
CHCI 3 ); if all three moments are different, the molecule is an unsym- 
metrical top (e.g., H 2 0). 

The rotational energy levels for the most general case (an unsym- 
metrical top) are very complicated. Hence a general quantum-statistical 
treatment is awkward on this account, as well as on account of quantum 
symmetry considerations (as with H 2 ) for some molecules (see Chapter 
22). Fortunately, a classical treatment is legitimate except in the case 
of very light (hydrogen-containing) gases at low temperatures (e.g., CH 4 
below 80°K). We consider the classical case only. Now even the classi¬ 
cal Hamiltonianf for a rigid unsymmetrical rotator is a little too com¬ 
plicated to derive here (since this is not a book on mechanics). We 
confine ourselves, therefore, to the statement that insertion of the clas¬ 
sical Hamiltonian into the classical phase integral leads, after a rather 
straightforward integration, f to (Problem 9-2) 

v 112 /W7AfcTY / V8T 2 7 B A:rY /2 

qr = —\— P - / \—v—J y—jr-) • (9_7) 

As before, the symmetry number <r has been introduced in Eq. (9-7) to 
correct for repeated counting of indistinguishable configurations in the 
classical phase integral. The symmetry number is the number of different 
ways in which the molecule can achieve, by rotation, the same (i.e., 
counting like atoms as indistinguishable) orientation in space. For ex¬ 
ample, we can see that <r = 12 for CH 4 (tetrahedral symmetry) as follows. 
If we imagine, say, that the tetrahedron is sitting on a table with one 
hydrogen up and three down, then there are three ways of arranging the 
“down” hydrogens (rotations of 0°, 120°, 240°) while keeping the same 
hydrogen up. But there are four possibilities for the “up* hydrogen. 


* For proof, see Chapter 5 of H. Goldstein, Classical Mechanics. Reading, 
Mass.: Addison-Wesley, 1950. 
t See Mayer and Mayer, pp. 191-194. 
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Therefore, a = 3 X 4. Similarly, <r = 2 for H 2 0, <r = 3 for NH 3 , 
«r = 4 for C 2 H <( and <r = 12 for CgHg. 

Equation (9-7) can also be written, in obvious notation, 


T i/a ( T a y n 
Qr = ~\0a@b0c/ 


(9-8) 


The rotational contributions to some of the thermodynamic functions 
of nonlinear molecules are then: 


A r = -NkT In 

V V2 / rpS \l/2 

(9-9) 

<r \0 a@b©c/ 

E r = %NkT, 


(9-10) 

Cvr = m> 


(9-11) 

T ,/2 

Cf _ XTL \ n H _ 

( T 3 e 3 ^ 1/2 

(9-12) 

Or — life m- 

<r 

V0A0B0C/ 


Here the rotational (kinetic) energy is ZNkT/2 instead of 2NkT/2, as for 
linear molecules, because of three instead of two rotational degrees of 
freedom. We do not need to know I a, Ib, and Ic to evaluate E r , Cvr, 
etc., but these quantities are required for the entropy and any thermody¬ 
namic function which involves the entropy (e.g., p, A, etc.). The moments 
of inertia can be computed if the molecular structure is known. This 
structure (i.e., the nuclear configuration at the minimum in u,) is too 
difficult to deduce by a quantum-mechanical calculation, so recourse must 
be taken to experimental information (rotational spectrum, electron and 
x-ray diffraction, etc.). 


9-4 Thermodynamic functions. We first bring together the various 
contributions to the mam thermodynamic functions. For linear polyatomic 
molecules, Eqs. (8-34) through (8-38) apply, except that mi + m 2 must 
be replaced by £,• m,-, and each term involving the vibrational frequency 
v must be replaced by a sum of similar terms over 3n — 5 vibrational 
frequencies. For nonlinear molecules, 


A 

NkT 


, n \ 2ir('Z i m i )kT f 2 Ve : 

~ m L A 2 


. 1/2 



— jfC + l n (1 — « hr ' lkT ) j + ^ + In w.i; 


(9-13) 
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Nk 


E 3 . 3 , 3 ^ 6 (to t hVj/kT N _ D. . 

NkT ~ 2 ^ 2 ^ \2fcr - 1/ fcT ’ 

(V 3,3, 3 $F?(hv\ 2 e k ’ ilkT 
iVA 2 ' 2 ' \Jfc77 (eWW - 1)2’ 

, r2F(E. t» ( )]fcr1 3 ' 2 Fe 8/2 , , ir ll2 e 312 ( T 3 Y' 2 

~ ln L m J # + <r Ve A e B ec/ 

+ E* [sT^TT - to (1 - .-«*’•>] + to 


p7 = NkT. 


(9-14) 

(9-15) 


(9-16) 

(9-17) 


The chemical potential follows immediately from p = (A + pV)/N 
(Problem 9-3). 

We now turn to examples of applications of the equations for Cv/Nk. 
Aside from the constants 5/2 (linear) and 6/2 (nonlinear), the only con¬ 
tributions come from the sum over frequencies in Eq. (9-15). We use 
frequencies obtained from infrared and Raman spectra. 

(1) For C0 2 , the four values of 8< = hvi/k are 1890, 3360, 954, and 
954°K. The respective vibrational terms in Eq. (9-15) are then calculated 
to be 0.086, 0.000, 0.483, and 0.483 at 312°K, or a total of 1.05. Hence 
Cv/Nk at 312°K is 2.50 + 1.05 = 3.55. The experimental value is 3.53. 

(2) For BF 8 , the 8< are 1270, 995, 2070, 2070, 631, and 631°K. The 
respective terms in Eq. (9-15), at 278°K, are 0.221,0.378,0.033 (X2), and 
0.662 (X2). The total is 1.99. Hence Cv/Nk is 4.99. The experimental 
value is 4.89. At 189°K, we calculate from the 8< that Cv/Nk = 5.05, 
whereas the experimental value is 5.04. 


Table 9-1 

Heat Capacity of N 2 0 


T, °K 

Cv/Nk 

(calc.) 

Cv/Nk 

(expt.) 

203 

3.06 

3.10 

293 

3.61 

3.61 

390 

4.08 

4.05 

467 

4.49 

4.46 

625 

4.89 

4.90 

733 

5.15 

5.15 
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(3) For N 2 0, the 0,- are 850, 850, 1840, and 3200°K. Table 9-1 com¬ 
pares with experiment, over a considerable temperature range, values of 
Cy calculated from these vibrational frequencies. 

The agreement between theory and experiment in the above examples 
is very good. Furthermore, this kind of agreement is found quite generally. 
However, the above treatment of the vibrational degrees of freedom is 
not adequate for molecules with internal rotation, such as ethane. We 
return to this subject in the next section. 

The entropy provides a somewhat more severe test of the theory 
than the heat capacity, since the rotational and translational con¬ 
tributions to the entropy are not just constants. Table 9-2 compares 
values of the molar entropy at 1 atmosphere pressure for a number of 
gases, obtained in two ways: (a) the “spectroscopic entropy” calculated 
from Eqs. (4-21) (monatomic), (8-37) (diatomic or linear polyatomic), 
or (9-16) (nonlinear polyatomic), using spectroscopic data as the source 
of the moments of inertia, vibrational frequencies, and ground state 
electronic degeneracies; and (b) the “calorimetric entropy” calculated 
from experimental heat capacities and heats of phase transitions. That is, 
we are using in (a) the statistical-mechanical relation <S = fclnQ, and 
in (b) the thermodynamic equation <S = g{T) (Eq. 2-35). Both these 
expressions relate to the same zero of entropy, as explained at the end 
of Section 2-4, and should lead to results which agree with each other. 
This is seen to be the case in Table 9-2. Three comments about this 
table should be made: (1) If the real gas is not effectively ideal at 
1 atm pressure and the temperature indicated, the calorimetric value of 


Table 9-2 

Entropy of Gases at 1 Atmosphere Pressure 


Gas 

;r, 0 K 

&spect> 

cal’deg-^mole -1 

'Scab 

cal*deg - 1 *mole -1 

A 

298.1 

37.0 

36.4 

Cd 

298.1 

40.1 

40.0 

Zn 

298.1 

38.5 

38.4 

Hg 

298.1 

41.8 

41.3 

n 2 

298.1 

45.8 

45.9 

o 2 

298.1 

49.0 

49.1 

HC1 

298.1 

44.6 

44.5 

HBr 

298.1 

47.5 

47.6 

NHa 

239.7 

44.1 

44.1 

C0 2 

194.7 

47.5 

47.6 

CH 3 Br 

276.7 

58.0 

57.9 
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5 has been corrected for gas imperfection by use of equation-of-state 
data (the correction is usually less than 0.3 cal-deg - 1 -mole -1 ). (2) The 
spectroscopic values of 5 for the diatomic molecules are slightly more 
accurate than would follow from Eq. (8-37), since the actual spectroscopic 
vibration-rotation energy levels have been used (see the last paragraph of 
Section 8-4). (3) The spectroscopic method is capable of greater accuracy 
than the calorimetric method, e.g., the deviations in Table 9-2 for A and 
Hg must be attributed to experimental error in the calorimetric work. 

There are, however, a few molecules for which a discrepancy between 
Aspect and Seal is noted. Some of these are included in Table 9-3. In 
every case, 5 0a i < <S ape ct. The explanation for CO, for example, is gen¬ 
erally accepted to be the following. 5 ape ot is calculated from the number 
of quantum states ft of the gas and should therefore be considered the 
true entropy of the gas (relative to ft = 1 as zero). Solid CO, in its true 
equilibrium state at 0°K, would have each CO molecule in some definite 
orientation (corresponding to the lowest possible energy) and 0=1. 
However, the CO molecule is effectively very symmetrical. It has a very 
small dipole moment and is isoelectronic with N 2 . Hence, in actually 
preparing the crystal at low temperatures, a random mixture of the two 
orientations CO and OC, rather than one particular orientation, is frozen 
into the crystal at each molecular position. ThuB the crystal is in a meta¬ 
stable state, and the rate of the process metastable state (mixed orienta¬ 
tion) —> stable state (definite orientation) is negligibly small at low 
temperatures. The metastable crystal has ft = 2 N at 0°K, instead of 
0=1. Since the calorimetric measurements are made on the metastable 
crystal, we have to write [see Eqs. (2-42) and (2-43)] 

S(T) = 5(0) + g(T) = 5(0) + S C ai, 

where S(0) = k In 2^ instead of 5(0) = k In 1 = 0, as usual. If R In 2 
is added to 5 0 ai for CO in Table 9-3, the agreement with S spe ct becomes 
satisfactory. 

Table 9-3 

Entropy of Gases at 1 Atmosphere Pressure 


Gas 

T ,°K 

$*peetf 

caI*deg~ I *moIe"’* 1 

$sali 

cal'deg _1 *raole _1 

‘S'spect $«al 


CO 

298.1 

47.3 

46.2 

i.i 

ft ln2 = 1.4 

NNO 

184.6 

48.5 

47.4 

i.i 

R In 2 = 1.4 

CH 3 D 

99.7 

39.5 

36.7 

2.8 

ft In 4 = 2.7 

h 2 o 

298.1 

45.1 

44.3 

0.8 

ft In (3/2) = 0.8 

d 2 o 

298.1 

46.7 

45.9 

0.8 

ft In (3/2) - 0.8 
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The explanation for NNO is presumed to be the same as for CO. In 
the case of CH3D, there is an obvious four-fold effective symmetry of 
orientation, so that the discrepancy is R In 4. The argument for H 2 0 
and D 2 0 is rather more involved,* and we omit it. The conclusion is that 
the discrepancy should be R In (3/2). In all these cases, satisfactory 
agreement with experiment is achieved after the correction is made. 

The next chapter, on chemical equilibrium in ideal gases, will provide 
tests of our theoretical expressions for the chemical potential of ideal gases 
(monatomic, diatomic, and polyatomic). 

9-5 Hindered internal rotation in ethane. There are many molecules, 
especially hydrocarbons, with internal rotation about bonds. Ethane 
is a relatively simple prototype, and we confine our rather qualitative 
discussion to this case. One of the 18 normal modes of vibration in ethane 
corresponds to a torsional oscillation of one methyl group relative to 
the other. This mode is neither Raman nor infrared active, so direct 
spectroscopic information about the frequency is unobtainable. However, 
if we measure the heat capacity of ethane and subtract the contributions 
to the heat capacity of the translational, (rigid) rotational and 17 spec¬ 
troscopically available vibrational degrees of freedom, the remainder 
must be the contribution C of the torsional mode. This calculation of 
C can be carried out at several temperatures. Values of C obtained in 
this way follow the solid part of curve 1, Fig. 9-1, rather closely. The 
one-dimensional Einstein heat-capacity curve which comes nearest to 
fitting these data over the whole temperature range is curve 2 in Fig. 9-1 
(hv/k = 350°K), but the fit is not at all good. We have to conclude, 



Fig. 9-1. Contribution of torsional mode to heat capacity of ethane. Curve 1: 
restricted rotator with Vo = 3100 cab mole -1 . Curve 2: simple harmonic 
oscillator with hv/k = 350°K. Curve 3: free rotator. 


* See Fowler and Guggenheim, pp. 214-215. 
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Fra. 9-2. Potential energy hindering rotation in ethane. 


then, that this degree of freedom is not a simple harmonic (torsional) 
oscillation. 

A second possibility is that the degree of freedom in question can be 
considered to be free rotation, about the C—C bond, of one methyl group 
with respect to the other. However, one free rotational degree of freedom 
at the temperatures in question would give the classical result C/Nk = 
1/2 = constant. This is shown as curve 3 in Fig. 9-1. Clearly this pos¬ 
sibility must also be discarded. 

The correct explanation has turned out to be intermediate between 
these two extremes. If $ is the angle between one methyl group and the 
other, looking along the C—C bond and taking $ = 0 when the two methyl 
groups are “staggered” (not lined up or “eclipsed”), then the potential 
energy (ground-state electronic energy) of the molecule varies periodically 
with 4>. This variation can be represented approximately by 

£F 0 (1 - cos 3*), (9-18) 

as shown in Fig. 9-2. That is, the rotation of one methyl group relative 
to the other is not free, but is hindered by a periodic potential, with barrier 
F 0 . However, when kT » F 0 , the rotation is effectively free and C/Nk 
approaches the value 1/2. On the other hand, when kT <K Fo, the system 
is trapped in one of the three minima in the potential, and the motion is 
simple harmonic (torsional) oscillation. In this limit, C/Nk approaches 
the low-temperature behavior of a one-dimensional Einstein heat-capacity 
curve (Eq. 5-14). 

The following steps are necessary to take care of an arbitrary tem¬ 
perature [e.g., kT = 0(Fo)]: (a) the one-dimensional Schrodinger equa¬ 
tion in the coordinate $ must be solved, using the potential (9-18), to 
give the energy levels; (b) the energy levels are inserted in the parti¬ 
tion function q* = YLi e~*i lkT ; and (c) C/Nk is computed from tem¬ 
perature derivatives of q*, as usual. This work can be carried out nu¬ 
merically. Of course, the function C{T)/Nk will be different for different 
choices of F 0 . Curve 1 in Fig. 9-1 is actually the theoretical curve for 
F 0 = 3100 cal-mole -1 . This curve fits the experimental points over the 
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temperature range of their availability (solid part of curve 1). Curve 1 
approaches curve 3 asymptotically at high temperatures. The theory 
has also been checked and F 0 evaluated by comparing S 0 »i with S apeet , 
over a temperature range, using the energy levels for hindered rotation 
referred to above for the 18th vibrational mode in S 8p eot- 

9-6 Hindered translation on a surface. Here we investigate an adsorp¬ 
tion problem that is very closely related in a formal way to the preceding 
section. In Section 7-1 we discussed the two extreme kinds of monolayer 
adsorption: localized and mobile. Here we consider the transition from 
one kind to the other. Suppose that we have a simple square lattice of 
adsorption sites on the surface of a crystal, with nearest-neighbor distance 
a. The surface area is a = L 2 = Ma 2 , where M is the number of sites. 
We restrict the discussion to the case of a very dilute monolayer: N <5C M, 
where N is the number of adsorbed molecules. The xy-motion of an 
adsorbed molecule is assumed independent of the z-motion. The z-motion 
is taken care of by q„ a one-dimensional harmonic oscillator partition 
function, as in Eqs. (7-3) and (7-18). We represent the potential energy 
function for the xy-motion (see Section 7-1 and Fig. 7-2), approximately, 
by 

V 0 (x, y) = U 00 + \ Vo (l - cos^) + i V 0 (l - cos ^) • (9-19) 

The resemblance to (9-18) for hindered rotation in ethane is obvious. 
At low temperatures, the molecules vibrate about the minima in J7<>(x, y) 
with frequency (Problem 9-4) 



Equation (7-3) is applicable in this limit. At high temperatures, the 
motion is free translation and Eq. (7-17) is appropriate. At intermediate 
temperatures, Eq. (9-19) must be used as it stands. We have then to find 
the quantum-mechanical energy levels and partition function for the po¬ 
tential U 0 (x, y). This problem becomes the same as that in the preceding 
section, after a suitable correlation between the notations. 

The exact treatment of this problem, which is complicated, was omitted 
in Section 9-5 and will be omitted here. We present instead a very ac¬ 
curate (over the entire temperature range) approximate solution which 
was introduced and tested against exact results by Pitzer and Gwinn 
for the hindered rotation problem. The approximation is to write the 
xy-partition function as 

. . 9har osc—quant 
Qxy — Qclass X “ > 

yhar osc—class 


(9-21) 
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where is the classical q xy using the potential U 0 — U 0 o in Eq. 
(9-19), $ho-q is the quantum harmonic oscillator partition function for 
motion about the minima in (9-19), and qho-a is the classical limit of 
tfho—q- It is easy to see that Eq. (9-21) has the right asymptotic properties: 

T 7 * 50, Sho—q 21 ) 0—0 qxy * 9clas»> 

I 1 * 9, 9clas! 1 9ho—o 2xy * 2ho—q* 


The complete partition function for the system is 

Q 1 N 

~m q > 

where 


Q = 


,-U«tlkT 


with q xv given by Eq. (9-21). 
Next we find g c u«>. We have 


+00 L 


where 


Then 


9eUs* = p jJJJ e BlkT dx dy dp x dp y , 

<—oo 0 

»-£+£+i*(*-«--?)• 


2 olsu = 


2irmkT 


h 2 L.J $ 

If we put u = V 0 /2kT and 6 = 2-Kx/a, 


?claas — 


2 irmkT 
h 2 


r ”[s‘4' 


(9-22) 

(9-23) 


(9-24) 


The integral is equal to 2irZo(w), where Z 0 (u) is a modified Bessel function 
of the first kind. Hence 


2olais — 


2irmkT 

h 2 


ae~ 2u I%{u). 


(9-25) 


We can obtain 2ho-« easily from g 0 i„„ using the limit T —► 0 (i.e., u—* oo): 


Qho—c — 0ol*»» (m 


oo) = M 



(9-26) 
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where v x is given by Eq. (9-20) and we have used the property 


lim I 0 (u) = 


(2int)»/2 


The factor M in Eq. (9-26) is to be expected, since there are M sites in 
the area « over which the phase integral in Eq. (9-24) is extended. Finally, 
in view of Eqs. (9-26), (5-6), and (5-7), we must have 


/ e -h, x l2kT \2 
“ M VI - e-A'i/My ‘ 


(9-27) 


We can now substitute Eqs. (9-25) through (9-27) into Eq. (9-21). 
We obtain 

2 icMue- 2u e- 2Ku ll{v) f . 

--(i _ e -Ku) 2 -’ (9-28) 

where 


„ _ ( 2* 2 V'» 

" \ma*VJ 


The adsorption isotherm is linear here (since the adsorbed phase is 
very dilute), but the proportionality constant between N and p (Eq. 7-20) 
is a function of temperature: 

N = (g^^V^V (9-29) 

Other thermodynamic properties can be deduced from Eq. (9-22) in 



50 100 200 

°K 


Fig. 9-3. Theoretical heat capacity curves (contribution from motion in 
plane of surface) for dilute adsorbed phase of argon atoms on 100 plane of KC1. 
Fo is the potential barrier of Fig. 7-2 and Eq. (9-19). 
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the usual way. For example, one finds that the ^-contribution to the 
heat capacity (constant N and Ct) is 


C 

Nk 


—1 -)- 2m 


[• 


Ji(«) „/?(«) 

/ 0 (M) W /g(M)J 


+ 


2K 2 u 2 e~ Ku 
(1 - e-*")* 


• (9-30) 


This equation is illustrated in Fig. 9-3 for argon atoms on a KC1100-plane. 
V 0 is probably about 200 cal-mole -1 . Experimental curves of this type 
have not been observed as yet. In general, the maximum in the heat- 
capacity curve is predicted to occur at about kT = V 0 /5. These curves 
belong to the same family as curve 1 in Fig. 9-1. 


Problems 

9-1. Locate the center of mass of the linear molecule HCN, given that the 
equilibrium intemuclear distances are 1.157 A for CN and 1.059 A for HC. 
Calculate the moment of inertia about the center of mass. Carry out the same 
calculations for DCN. (Page 164.) 

9-2. For a spherical top, the rotational energy levels are 




jU + w 2 

Sir *I A ’ 


3 *= 0 , 1 , 2 , 


with degeneracy wj = (2 j + l) 2 . Ignore molecular symmetry, replace the sum 
in q, by an integral, and verify Eq. (9-7) for the case I a = Ib = Ic- (Page 165.) 

9-3. Derive expressions for n and vP(T) for nonlinear polyatomic molecules. 
(Page 167.) 

9-4. Derive Eq. (9-20) for v x from the potential (9-19). (Page 172.) 

9-5. H 2 O has an OH distance 0.958 A and an HOH bond angle of 104°27'. 
Find the center of mass and calculate I a, Ib, and Ic (see Problem 9-11). 

9-6. CH 4 has a tetrahedral structure with a CH distance 1.094 A. Calculate 
Ia = Ib = Ic- 

9-7. Show that in localized adsorption, in the limit as N/M —*• 0, 


W Nl(M - N)\ N\ ( q> 


Equation (9-22) with (9-26) is an example. The equation of state is <pd. = NkT, 
in view of Problems 3-1 and 3-6. This is verified by Eq. (7-7). 

9-8. Calculate v x from Eq. (9-20), using Fo = 200 cal-mole -1 , a — 4 A 
and m for argon. 

9-9. From Eq. (9-28), derive Eq. (9-30) for C/Nk. 

9-10. In CO 2 , the CO distance is 1.161 A. The vibrational characteristic 
temperatures are 6 ,- = 1890, 3360, 954, and 954°K. Calculate the entropy of 
CO 2 per mole at 1 atm pressure and 194.7°K (see Table 9-2). 
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9-11. The normal vibrational frequencies for H 2 O are 3650, 1590, and 3760 
in cm -1 . The moments of inertia are 1.024, 1.921, and 2.947 X 10 -40 gm*cm 2 . 
Calculate S and p per mole at 1 atm pressure and 25°C (see Table 9-3). 

9-12. Use the data given in the text to calculate Cv for N 2 O at 733°K (see 
Table 9-1). 
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CHAPTER 10 

CHEMICAL EQUILIBRIUM IN IDEAL GAS MIXTURES 

In previous chapters we have shown how the thermodynamic proper¬ 
ties of ideal monatomic, diatomic, and polyatomic gases can be calculated 
by statistical-mechanical methods. An important application of this work 
is to chemical equilibria occurring in ideal gas mixtures. The object is to 
deduce equilibrium constants for such reactions, using spectroscopic in¬ 
formation about the individual molecules. Equilibrium constants obtained 
in this way are often more accurate than those found by direct measure¬ 
ment. Chemical equilibria in imperfect gases will be considered in Chap¬ 
ter 15. 

10-1 General relations. The statistical-mechanical basis of the second 
law of thermodynamics was examined in Section 2-3. In particular, we 
considered the second law in the form S —* maximum (isolated system), 
A —* minimum (N, V, T constant), F —» minimum (AT, p, T constant), 
etc. Since from these statements of the second law, by thermodynamic 
arguments, one can deduce results such as Aju = 0 for phase or chemical 
equilibria, we may take the position that the equilibrium criterion Ap = 0 
has been given a statistical foundation. We have already adopted this 
attitude, for example, in studying the vapor pressure of a Debye crystal 
(Section 5-3) or of an adsorbed phase (Chapter 7). However, because of 
the importance of chemical equilibria, and for variety, we deviate from 
this policy here on two counts: (a) In this section we give the thermody¬ 
namic derivation of Ap = 0 for a homogeneous (i.e., one-phase) chemical 
reaction from the equilibrium criterion F —* minimum (N, p, T constant), 
(b) In the next section we verify the thermodynamic result in (a) for a 
special case, using a purely statistical argument. 

We now turn to the derivation of Ap = 0 for a chemical reaction. Con¬ 
sider a closed one-phase system, at equilibrium, at pressure p, temperature 
T, and containing the numbers of molecules N a, Nb, Nc, Ni, N 2 , .... 
The system may be a gas, liquid, or solid. The equilibrium of the system 
includes equilibrium with respect to a chemical reaction, say 

vaA + vbB ?=* vcC, ( 10 - 1 ) 

where v A , vb, and vc are small integers. The species 1, 2, ... are present 
but do not participate in the reaction. The Gibbs free energy of this 
multicomponent system is 

F = Napa + NbPb + NcPc + N iMi + N 2 P 2 + • • • • 

177 
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Now consider the infinitesimal process of converting, say, Va d£ molecules 
of A and vb d£ molecules of B into vc d£ molecules of C. Since the process 
is an infinitesimal one, none of the intensive properties of the system will 
change appreciably and hence all the /it’s are constant during the process. 
The change in F in the process is therefore 

dF = ha dNA + Ms dN b + Me dNc 

= (—I 'AHA — vbhb + Vchc) dk = Am d{. 

No terms in Hit Pi, etc., appear here, since N i = constant, etc. Since the 
process occurs in a closed system (molecules of A, B, and C are intercon- 
verted but no molecules go in or out of the system) at constant p and T, 
and the system is at equilibrium, we must have, according to the second 
law, dF — 0. We therefore conclude, since df is arbitrary, that Aj* = 0 or 

vaha + vbhb = vchc. (10-2) 


The chemical potentials here must be assigned the values they have after 
chemical equilibrium has been achieved. This result is general; it is not 
restricted to an ideal gas mixture, or even to gaseous reactions. 

Now let us apply Eq. (10-2) to the special case in which we are inter¬ 
ested in this chapter—a chemical reaction occurring in an ideal gas mix¬ 
ture. First we note that the expression for the chemical potential of each 
species i, in terms of AT,-, V, and T, is the same as if only that species existed 
in the system (this is true, of course, only for an ideal gas mixture). To 
see this, consider, say, a binary system. Then from Eq. (3-15) [see also 
Eq. (4-27)], 


Q(Ni,N a ,V,D = 


qi(V, T) N ' 
Nil 


qi(V, T) N * 
N 2 l ’ 


(10-3) 


where qi and 92 may refer to monatomic, diatomic, or polyatomic mole¬ 
cules. We find easily from 


that 



Mi. _ 
kT ~ 


In 


qi(V, T) 

Ni 


(10-4) 


just as for a one-component gas. 

If the reaction (10-1) occurs between molecules A, B, and C in an ideal 
gas mixture, the equilibrium condition is (10-2), where each chemical 
potential is given by an equation of the form (10-4). Substituting Eq. 
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(10-4) in (10-2), we find 

N? q’c 

qpqp 


(10-5) 


The N’s in this equation refer, of course, to the numbers of molecules 
present after the chemical equilibrium has been established. Now, each 
q is equal to V multiplied by a function of temperature only [see Problems 
3-1 and 3-6 and Eqs. (4-10), (8-4) and (9-1)], so that q/V is a function 
of temperature only. Therefore, the quotient of q’s in Eq. (10-5) is in 
general a function of both V and T, but we have 


Pc C = (Qc/V) ,c = Km 
pPpP {u/V)'*{<ib/vY* 


( 10 - 6 ) 


where pc = Nc/V, etc., and K(T) is a function of temperature only, 
called the equilibrium constant. 

As the partial pressures are p% = pjkT, we also have 

Pc° = (g c kT/V)’ c 
P1 A PP (qAkT/V)'Hq B kT/V)’* 

= (kT)’ c - rA -’’ B K(T) = K P (T), (10-7) 

which is, in general, a different equilibrium constant, often used. 

In thermodynamics, one substitutes 


m = p,i(T) + kT In pi 


for each component in Eq. (10-2) and obtains 

_j£L = e -*r 0 l" = K P (T), (10-8) 

Pl A Pp 

where 

A F°(T) m vcuUD - VaA(T) - v B p° B (T). (10-9) 

This is called the “standard free energy change” for the reaction. Equa¬ 
tions (10-7) and (10-8) are, of course, equivalent and are interrelated by 


kT ~ 



( 10 - 10 ) 


for each participant in the reaction [see, for example, Eqs. (4-26) and 
(8-40)]. 
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In Eqs. (10-6) and (10-7), or indeed in any application of Aft = 0, 
it is always understood that the zeros of energy chosen for the different 
molecular species (or states, in a phase equilibrium) involved must be 
self-consistent. For example, one can choose as zero for each molecule the 
separated atoms of which the molecule is composed, at rest. Since the 
same atoms occur on both sides of the chemical equation (10-1), this choice 
necessarily leads to the same zero for reactants and products. A number 
of specific examples will be found in Section 10-4. 

10-2 Statistical derivation in a special case. In this section we derive 
Eq. (10-5) for a simple reaction, using a statistical argument which by¬ 
passes the thermodynamic equation (10-2) (but does use the second law of 
thermodynamics). We could use the more complicated reaction (10-1), 
but there would be no particular advantage in this since we merely want 
to illustrate the equivalence of the two approaches. 

Consider an ideal gas mixture made up of Na molecules of type A and 
Nb molecules of type B in a closed container with fixed V and T. We 
assume that in the absence of a catalyst, no chemical reaction takes place 
between A and B molecules. The canonical ensemble partition function 
for this binary system is naj/b 

Wa, N b , V, T) = , (10-11) 

and the Helmholtz free energy is A — —kT In Q. Suppose a catalyst is 
added, making possible the chemical reaction A «=* 2 B, which will take 
place spontaneously until equilibrium is established with respect to the 
reaction. Since the system is closed and has constant V and T, we know 
from the second law of thermodynamics that as the reaction proceeds the 
Helmholtz free energy will decrease, and that at equilibrium this free 
energy will have its minimum possible value consistent with V, T, and 
with the fixed amount of material in the container. (We can specify this 
amount by stating that each A molecule contains two B units, and the 
total number of B units is 2 Na + Nb = N = constant.) Translating 
this into statistical-mechanical language, we can say that the equilibrium 
point of the reaction for given V and T will correspond to the maximum 
possible value of Q (since A = —kT In Q) consistent with the restraint 
2 Na + Nb = N = constant. 

We need, then, to maximize Q in Eq. (10-11) subject to 2Na + Nb = N. 
We could use an undetermined multiplier, but this is unnecessary here 
because the restraint can be used instead to eliminate, say, Nb from 
Eq. (10-11) (see Appendix III). We have 

In Q = Na In Qa + (N — 2 Na) In — Na In Na 
- (N - 2 Na) In (N - 2 N A ) + N — Na, 
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and hence 



. q A (N - 2NS) 


where Na is the value of Na making Q a maximum—that is, the value of 
Na at equilibrium. Therefore, 


W£_q | 
N1 ~ U 


(10-12) 


which is the same as Eq. (10-5) for this special case. 


10-3 fluctuations in a simple chemical equilibrium. Here we consider 
a situation very similar to that in the preceding section: we have a binary 
gas mixture with numbers of molecules Na and Nb, in a volume V at 
temperature T. The reaction A ?=* B is then made possible by introduc¬ 
tion of a catalyst. The system proceeds to equilibrium with V, T, and 
Na + Nb = N held constant. 

We adopt a slightly more general point of view here than in the pre¬ 
ceding section. That is, we use the partition function Q(N, V, T) for the 
system described above, including all possible states of the system for 
given N, V, T; that is, including all possible values of Na and Nb con¬ 
sistent with any assigned value of N. Thermodynamically, this is a one- 
component system (N). Then 

M.v.n- £ jfrSh <>o-w> 

Na.Nb 

(Na+Nb-N) 


— (gA + Qb) N 

~ Nl 


(10-14) 


The treatment in the preceding section corresponds to the use of only the 
maximum term in the sum (10-13). With the retention of the full sum, we 
can easily investigate the extent to which we can expect to find fluctua¬ 
tions about the equilibrium concentrations predicted by equations such 
as (10-5) and (10-12). 

The average value of N a is clearly 



E 

If A. If B 
UfA+NB-N) 


NAfft A & B 

Na'.NbI 


(10-15) 


If Q in Eq. (10-13) is regarded in a purely formal way as a function of N, 
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qA, and qB, we note from Eq. (10-15) that 

ir, ■ 

\ oqA /q B ,N 

Then, from Eqs. (10-14) and (10-16), 

tj a 

?A + ?B 

and therefore, since Fa + F* = N, 

Fg _ qB 
Fa qA ’ 

in agreement with Eq. (10-5). Now if we differentiate 

W A N a \N b \ 

Na.Nb 

Wa+vb-N) 

with respect to qA, we obtain 


(10-16) 


(10-17) 


(10-18) 


(10-19) 


Finally, from Eqs. (10-17) and (10-19), we have 

N*a ~ (Fa ) 2 = = N% - (F b ) 2 . (10-20) 

Fluctuations about the equilibrium composition are therefore very small, 
as we might expect: <tna/Fa = 0(2V"~ 1/2 ), the usual result (see Section 
2-1). 


10-4 Examples of chemical equilibria. In this section we discuss five 
particular illustrations of the above equations. 

(a) Isomeric equilibrium. This is the case A & B already considered 
as a fluctuation problem in the preceding section. We wish to examine the 
equilibrium relation (10-18) here. As an example, suppose that some of the 
energy levels of A are lower than those of B (the translational levels 
would be the same in the two cases, because A and B have the same mass 
—so these are not involved), but that the levels of B are closer together, 
as shown schematically in Fig. 10-1. This would arise, for example, if 
(a) the chemical bonds in A are more stable than in B (i.e., the minimum 
in the ground electronic energy surface is lower for A), (b) the vibrational 
force constants are in general larger in A than in B (i.e., the chemical 
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t 


A B A+B 

Fig. 10-1. Energy states for two isomers, A and B (schematic). 


bonds are “stiffer” in A), and (c) the moments of inertia are larger in 
B than in A (i.e., the atoms are more spread out spatially in B). 

We can take the point of view that a given molecule has accessible to 
it the full set of energy states indicated by A + B in Fig. 10-1, with 
partition function q, hut that these states can be classified into two sub¬ 
groups called A and B, with partition functions q A and qa- 

q A = 

i i 

4 = E e ~* lkT + E e ~* ,kT - + qa. 


There is a single Boltzmann distribution of molecules among all the 
levels A + B. The fraction of molecules in any level, say e, is e~* ,kT /q. 
Therefore the fraction of molecules in all levels belonging to subgroup A is 


N a 

N a +Nb 



9a , 

9 ’ 


in agreement with Eq. (10-17). 
The expression 


Nb _ qB 
N a q A 


E^ /tr 

Zte-'ilkT 


shows how the density and energy of the two subgroups of quantum states 
determine the equilibrium composition Nb/N a . The first few terms in 
the sum q A are the largest of all, because the energies in the exponents are 
lowest. But the number of terms in qa is larger, so that > q A is still 
possible. In general, a substance is favored in an equilibrium (i.e., its q 
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is larger) if it has low-lying and dense energy levels. The first is primarily 
an energy effect and the second an entropy effect. In the example in Fig. 
10-1, A would predominate at low temperatures, because the occupation 
of low levels is emphasized by the Boltzmann factor at low temperatures. 
But B would be favored at high temperatures, because the Boltzmann 
distribution becomes more uniform over all levels, regardless of energy, 
at high temperatures. Then the number or density of levels counts most. 
This is quite general: in an equilibrium competition of this sort between, 
say, reactants with low E and products with high S, the reactants will 
win out at low temperatures and the products at high temperatures. At 
intermediate temperatures, there will be a compromise between low 
energy (A) and high entropy (B) to give the system (as always) the 
minimum possible free energy E — TS (N, V, T constant). 

(b) Ionization of hydrogen atoms. The reaction we consider here is 

H *± H+ + e~. (10-21) 

The concentrations of all species and the temperature are such that classi¬ 
cal statistics and ideal gas behavior can be assumed. The zero of energy is 
chosen as separated proton and electron at rest. The ground state (the 
only state we need consider) of the hydrogen atom relative to this zero 
then has an energy « el = —13.53 ev. Because of electron spin, <o n == 2 
for the hydrogen atom. There is the same degeneracy co = 2 for the elec¬ 
tron. The proton spin degeneracy can be ignored as usual (for nuclei), 
since it appears in both H and H + and cancels in the equilibrium constant. 
Then we have: 

( 2irmH+kT\ 3li Tr 

?H+ = 9/( H+) = ^-p- J V, 

0 ( 2irm,-kT\ 312 Tr 
«•- = 3««-> = 2 ^—p- J V, 

„__ _ n (2-Km-ELkf\ 312 „_-UMkT 

9 h — g«H)3«(H) = 2 1—p— 1 Ve 

Hence, from Eq. (10-6), the equilibrium constant for the reaction (10-21) 
is 

K(T) == pH * P< ~ = (10-22) 

where we have neglected the difference in mass between H + and H. If 
the ionization is small, p H is practically equal to Ph, the total concentra¬ 
tion of H atoms placed in the volume V. Also, necessarily pn+ = p e — 
In this case the fraction a of H atoms which have ionized at equilibrium is 
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„ = £M+ _ [ 1 (2mn e -kT\ 312 ^ikr] 112 

PS h> ) 6 J 


(10-23) 


As a numerical example, let us take T = 10,000°K and p& = 0.01 mole- 
liter -1 . Then we find from Eq. (10-23), a = 7.8 X 10 -3 (Problem 10-1). 
Equation (10-22) has been verified experimentally for Cs <=* Cs + + e~. 

In the equilibrium (10-21), the hydrogen atoms are favored by a much 
lower energy, but the dissociated state is favored by the entropy. The 
numerical calculation above shows that very high temperatures are needed 
for the entropy to overcome the energy in this particular competition. 

(c) Dissociation of hydrogen molecules. We assume that the tempera¬ 
ture is such that the rotation in H 2 is classical and that the hydrogen 
atoms themselves ionize [as in (b)] to a negligible extent. The ground- 
state degeneracy of H 2 is unity. The zero of energy is chosen as separated 
atoms at rest. On this basis, the ground state energy of H 2 is (Section 8-1) 
e«i = —D, — —4.722 ev. The partitions functions are 


9h 




{4:TrmukT \ 312 xr T e e ’ l2T 

$h 3 - q^rQvq. - y h2 J V ■ 2 0 r • i _ g-e./r ' * 

Then, for the reaction 


Hjf±2H, 


(10-24) 


(10-25) 


we have, from Eq. (10-6), 


K(T) = ^3. = 

ph, (qu,/V) 


If the fraction a of dissociated molecules is small, p H2 = pg 2 (total con¬ 
centration of H 2 placed in F), and 


2 _ Ph f 1 (iwmnkT V' 2 e r 1 - e~ e ’ /T 

a P° Ki jP^V V ) T ■ «-*./» r 


e 


,«el !kT\ 


11/2 


(10-26) 


Thus, aside from pjj 2 and T, a depends on the mass m H , the equilibrium 
internuclear separation r„ the vibrational force constant/, and the ground- 
state electronic energy e e i- If we use the values of 8, and 0, for H 2 in 
Table 8-1, we find at 1000°K and Pn a = 0.01 mole-liter -1 , a = 4.6 X 
10 -9 (Problem 10-2). Again, as in (b) above, hydrogen molecules are 
favored in the equilibrium by the energy, but hydrogen atoms are favored 
by the entropy. This will in fact always be the case in a dissociative 
equilibrium. 
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(d) Isotopic exchange reaction. Consider the reaction 

H 2 + D 2 r± 2 HD (10-27) 


at temperatures such that rotation is classical and vibration is unexcited. 
The Bom-Oppenheimer approximation has interesting consequences in a 
reaction of this type. These follow from the fact that in this approxima¬ 
tion the ground-state electronic energy u e (r) must be the same function 
for all three species: H 2 , D 2 , and HD. Therefore r», /, and e,! [see (c) 
above] must all be the same for the three species. Hence, in the quotient 


K(T)== -Pk> -_db_, (10-28) 

PHjPDj PH a PD a SH,«Da 


the translational factors cancel except for the masses; the rotational factors 
cancel except for the reduced masses (7 = prf) and the symmetry num¬ 
bers; and the electronic factors cancel completely. Only the zero-point 
vibrational terms e -e » /2r [see Eq. (10-24)] are left in the (unexcited) 
vibrational partition functions, and in these the frequencies are the same 
except for the proportionality to p _I/2 (Eq. 8-18). Specifically, the sep¬ 
arate quotients in Eq. (10-28) are easily seen to be: 


Translation: 


(f»H + tod ) 3 j 
(4tohtod) 8 /2 


Rotation: 


16 mHWp 
(% + TO p) 2 


Vibration: 



hv HD L _ TOp /2 + toe/ 2 1| 

kT L 2i/2(m H + TO D )i/2jj' 


Putting these together we find 


K(T) = 2 


(toh + tod) 
TOI/2to1/2 

U D 


hv HD r, 

, mb 2 + mU 2 11 

kT L 

2i/2(m H + tod) I/*.]/ 


(10-29) 


Using phd = 3770 cm -1 , we calculate K = 3.46, 3.67, 3.77, and 3.81 
for T — 383, 543, 670, and 741°K. These are in good agreement with the 
respective experimental values 3.50, 3.85, 3.8, and 3.70. The theory must, 
in fact, be considered more accurate than the experimental work. 

Isotope effects are not usually this large, of course. Note that if we put 
toh = tod in Eq. (10-29), K — 4 (owing to the symmetry numbers). 
For the chlorine isotopes 35 and 37,1C = 4(1 — $), with 5 = O(10 -4 ). 
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(e) “Water-gas” reaction. As a final example, we calculate the equilib¬ 
rium constant K for the so-called “water-gas” reaction, 


C0 2 + H 2 ?=* CO + H 2 0. (10-30) 


We restrict ourselves to temperatures high enough so that rotation is 
classical in H 2 . Then 

_ PCOPHaO _ PCOPHaO _ 9COgHaO 
PcOjPHj PcojPHj 9cOi9h» 


where 


_ 9co9h 3 o e -\* tl ikT 
gfeoagiia 


(10-31) 


Ae«i = e,i(CO) + <«i(H 2 0) — €,x(C 0 2 ) — e«i(H 2 ), (10-32) 


and g' means the partition function q with the electronic part q„ omitted: 
g' = q t qrqv For all these molecules, «,j = 1. The zero of energy is the 
separated atoms (C, O, H) at rest. Actually, we do not need to know 
each e,i separately relative to this zero, but only the combination Ae«i. 
For this reaction (from combined thermal and spectroscopic information), 
A«,x = 7000 cal-mole -1 . Incidentally, the energy change for the re¬ 
action at 0°K involves the zero-point vibrational energies as well, and is 
given by 

AF(0) = Ae.x + (A/2)A^X) 

= 7000 + 2640 = 9640 cal-mole -1 , 


where the vibrational frequencies can be found in Table 8-1 and Problems 
9-10 and 9-11. The rotational data are included in the same sources 
(ico, = 71.9 X 10 -40 gm-cm 2 ). With this information (A«»x, p’s, I’ s), 
K can be calculated by use of expressions for q t , g„ and g„ which we shall 
not write down again. One finds K = 0.45 at 900°K and 1.41 at 1200°K 
(actually using spectroscopic energy levels instead of the slightly approxi¬ 
mate equations in Chapters 8 and 9), compared with the respective ex¬ 
perimental values 0.46 and 1.37. 
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Problems 

10-1. Calculate the fraction of hydrogen atoms ionised at 10,000°K when 
Pg = 0.01 mole-liter -1 . (Page 185.) 

10-2. Calculate the fraction of hydrogen molecules dissociated into atoms 
when Ph, — 0.01 mole-liter -1 and (a) T *■ 1000°K, (b) T = 5000°K. (Page 
185.) 

10-3. Repeat the argument of Section 10-2, but maximize the partition func¬ 
tion ACNa, Nb, p, T) [see Eq. (4-35)] at constant N, p, T instead of Q at con¬ 
stant N, V, T. 

10-4. In the example of Section 10-3, show that 

PI-W. - 

Why the negative sign? 

10-6. Use the data in Table 8-1 to calculate K at 700°K for the reaction 
2HI pi H 2 + I 2 . 

10-6. Calculate K at 1000°K for the reaction I 2 <=* 21 (see Section 4-4 and 
Table 8-1). 

10-7. Calculate 0, for H 2 , given that v = 3770 cm -1 for HD. 

10-8. Calculate 0, for H 2 , given that, for H 2 , £>, — 4.722 ev and Do “ 4.454 
ev. 

10-9. Calculate K for the “water-gas” reaction at 1200°K using data in the 
text. 


Supplementary Reading 

Fowler and Guggenheim, Chapter 5. 

Mayer and Mayer, Chapter 9. 

Rushbrooke, Chapter 11. 
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CHAPTER 11 

THE RATE OF CHEMICAL REACTIONS IN IDEAL 
GAS MIXTURES 

Strictly speaking, the subject indicated in the chapter title is outside 
the scope of this book, which is devoted to equilibrium statistical me¬ 
chanics. However, Eyring’s approximate absolute reaction rate theory 
has a quasi-equilibrium foundation: it is based on an application of the 
chemical equilibrium theory of the preceding chapter. For this reason, 
and because of its importance, it seems appropriate to include an account 
of the Eyring theory in the present work. But the treatment we give will 
be very brief, and we shall not consider any detailed special cases. The 
reader should consult the book by Glasstone, Laidler, and Eyring (see 
the Supplementary Reading list) for further details. 

More exact approaches to this problem will not be discussed here, since 
they cannot be put in quasi-equilibrium form. 

An example of a nonchemical application of Eyring's theory is presented 
in Section 11-3: the surface diffusion of a dilute, localized monolayer. 

11-1 Potential surfaces. There are two distinct stages in the Eyring 
theory. The first is the purely quantum-mechanical one of calculating the 
ground-state electronic energy surface (potential surface, for short) for 
the reaction, and the second is the statistical-mechanical calculation of 
the reaction rate. This division is the same as that which we have en¬ 
countered in calculating the thermodynamic functions of, say, an ideal 
polyatomic gas (Chapter 9). In this latter problem, we first have to find 
the potential surface of the molecule by quantum mechanics (or obtain 
equivalent information empirically from spectroscopy). This surface 
(see Section 9-1) determines the equilibrium structure of the molecule, 
the moments of inertia, the vibrational force constants and normal co¬ 
ordinates, and the depth of the potential well in the surface relative, say, 
to separated atoms as zero. With this information, we can then turn to the 
statistical-mechanical problem of deducing the thermodynamic functions. 

We discuss the potential-surface part of the rate problem in this section, 
and the statistical-mechanical part in the next section. 

For ease of visualization, let us consider a hypothetical one-dimensional 
reaction 

A + BC —» AB + C. (11-1) 

Three atoms and (in one dimension) three nuclear coordinates are in- 
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Fig. 11-1. Potential surface (u.) in the form of a contour diagram for a 
hypothetical one-dimensional reaction A + BC —* AB + C. The numbers on 
the contours are values of u, in arbitrary units. 


volved. One coordinate is concerned with the center of mass and is there¬ 
fore uninteresting for the above process. The other two coordinates de¬ 
termine the configuration of the three nuclei relative to each other (see 
Appendix V). For example, we might choose for these two coordinates the 
intemuclear distances r AB and tbc- For given values of t A b and r B c 
the ground-state electronic energy u e (r AB , r BC ) is calculated. From a 
large number of such values of r AB and r B c, one can construct a potential 
surface in the form of a contour diagram, which in a typical case might 
appear as in Fig. 11-1. The valley at the upper left corresponds to an A 
atom and a diatomic BC molecule. (The curvature of the surface at the 
bottom of and perpendicular to the valley determines the vibrational 
frequency in BC; the depth of the valley is a measure of the energy of the 
bond BC.) The valley at the lower right corresponds to the state AB + C. 
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Reaction coordinate 


AB + C 



(b) 



Fig. 11-2. (a) Potential energy (u,) along the reaction path in Fig. 11-1. 
Note the potential barrier, (b) Corresponding curve from Fig. 11-3. Note the 
minimum in u, (stable molecule, ABC), (c) Corresponding curve from Fig. 11-4. 

The high plateau is A + B + C. When reaction (11-1) occurs, the lowest 
possible path from reactants (A + BC) to products (AB + C) is the 
dashed line in Fig. 11-1. The highest point on this path is marked X in the 
figure. This is the “activated state,” and the triatomic system A, B, C 
at this point is referred to as an “activated complex,” denoted by (ABC)*. 
If one plots the potential energy u e along the dashed path of Fig. 11-1 
as a function of the distance along the path (called the “reaction co¬ 
ordinate”), one obtains a curve as in Fig. ll-2(a). The height Awf of the 
potential energy barrier which must be overcome is called the “activation 
energy." 
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r BC 

Fio. 11-3. Potential surface (u«) in the form of a contour diagram in one¬ 
dimensional case where stable molecule ABC is formed. 


For the reverse reaction, 

AB + C —► A + BC, (11-2) 

the reaction path in Fig. 11-1 must be reversed in direction. In this 
example the activated state and complex (X in Fig. 11-1) are the same 
for forward (11-1) and reverse (11-2) reactions. Let 

Aw, = u e (AB + C) - u e (A + BO- (11-3) 

This is Aw, for the reaction as written in (11-1) and has the same meaning 
as Ae,i in Eq. (10-32). In Fig. 11-1 it is determined by the difference in 
levels of the two valleys. In Figs. 11-1 and ll-2(a), Aw, is negative. If 
Awf is the activation energy for the forward reaction, Awf — Aw, is the 
activation energy for the reverse reaction. 
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Fig. 11-4. Potential surface (u,) in which a basin occurs between two 
potential barriers. 

In a real (three-dimensional) triatomic reaction we would need a con¬ 
tour diagram j n three-dimensional {tab, tbc, *ac) instead of two-di¬ 
mensional space (or a surface in four-dimensional space). But the general 
concepts introduced above for a hypothetical one-dimensional reaction 
remain the same. Incidentally, it should be noticed that linear configura¬ 
tions of A, B, C are included in the potential surface (in fact, activated 
complexes in triatomic reactions are usually linear), and we recall that 
linear molecules have four, not three, vibrational coordinates. However, 
this is not contradictory to the statement that u« is a function of the three 
variables tab, tbc, and tac only, for two of the normal modes in a linear 
molecule are degenerate (i.e., of the same frequency). For example, the 
positions of the atoms in the two degenerate modes of C0 2 [shown pre¬ 
ceding Eq. (9-3)] are expressible by the same sets of values of tab, tbc, 
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and r A c (the triangle ABC is merely rotated 90° in going from one mode 
to the other). 

In contrast to Fig. 11-1, we show in Fig. 11-3 what a typical (one¬ 
dimensional again) potential surface might look like in the event that A, 
B, and C formed a stable molecule ABC. The potential well in Fig. 11-3 
is characteristic of a stable molecule. The equilibrium molecular geometry 
of ABC is determined by the location of the bottom of the well, and the 
vibrational motion of the molecule is determined by the shape of the well 
in the neighborhood of the minimum [see also Section 9-1 and Eq. (V—1)]. 
A plot of w« as a function of distance along the lowest possible path from 
the upper left valley to the lower right valley in Fig. 11-3 would appear 
as in Fig. ll-2(b). This should be contrasted with Fig. ll-2(a). 

Another type of potential surface is shown in Fig. 11-4. A basin is 
situated between two potential barriers, X and X'. The potential energy 
as a function of the reaction coordinate is shown in Fig. ll-2(c). A basin 
may possibly occur,* for example, in the reaction 

H + H 2 —► H 2 + H. 

11-2 Absolute rate theory. As a concrete example, let us return to the 
reaction (11-1) and discuss it now as a real three-dimensional reaction. 
We suppose that the potential surface is of the general type shown in 
Fig. 11-1; that is, the potential “profile” has one potential barrier, as in 
Fig. ll-2(a). 

The fundamental assumption of the Eyring theory is that, during the 
course of the reaction (11-1), molecular configurations corresponding to 
the upper left valley in Fig. 11-1 (i.e., reactant molecules) are in thermo¬ 
dynamic equilibrium with molecular configurations corresponding to the 
neighborhood of the activated state X in Fig. 11-1 (i.e., activated com¬ 
plexes). This is an assumption which cannot be rigorously correct, but 
which is probably rather accurate in many cases. This assumption of 
equilibrium between reactant molecules and activated complexes makes it 
possible for us to use the methods of the preceding chapter on chemical 
equilibria to deduce the concentration (defined below) of activated com¬ 
plexes. From this knowledge, as we shall see, we can then calculate the 
number of reactants passing over the barrier, from upper left to lower 
right, per unit time and per unit volume of the system. This is the desired 
reaction rate. 

An activated complex is very much like an ordinary stable molecule. 
It has a definite mass {m A + ms + me in this example) and a definite 

*See Glasstone, Laidler, and Eyring (Supplementary Reading list), 
p. 108, and R. E. Weston, Jr., J. Chem. Phys. 31, 892 (1959). 
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nuclear configuration (corresponding to the position of the X in Fig. 11-1 
at the top of the potential barrier). This configuration determines the 
moments of inertia and the symmetry number. Thus we can immediately 
write (assuming the potential surface is available) the translational and 
rotational partition functions for an activated complex, just as we did in 
Chapter 9 for a stable molecule. 

Furthermore, we can carry out a normal-coordinate analysis for the 
vibrational frequencies, based on the shape of the potential surface in the 
neighborhood of the activated state. If the activated complex is linear, 
there will be 3n — 5 normal vibrational modes, otherwise there will be 
3n — 6. The activated state is located at a saddle point in the potential 
surface. That is, although the activated state is a maximum point along 
the reaction coordinate, it is a minimum in other directions (e.g., in Fig. 
11-1, in the direction perpendicular to the reaction coordinate). This 
feature will appear automatically in the normal-coordinate analysis when 
the potential energy u e is expressed in terms of the normal coordinates 
{< [see Eq. (V-13)]. Necessarily (i.e., by definition of a normal coordinate), 
u, will be a sum of squared terms in the {,-, and the coefficients will be posi¬ 
tive as usual (Eq. V-13) except for one coordinate, call it {, which will have 
a negative coefficient. This particular normal coordinate is the rigorous 
equivalent of what we have hitherto been loosely calling the reaction co¬ 
ordinate. The coefficient of { 2 in u, is negative because the potential sur¬ 
face Jails off on both sides of the activated state along this direction (and 
this direction only). 

From the normal-coordinate analysis we thus obtain 3n — 6 (linear) 
or 3n — 7 (nonlinear) ordinary vibrational frequencies v t . The frequency 
associated with {is imaginary because the force constant (twice the coeffi¬ 
cient of { 2 ) is negative. We can therefore construct, in the usual way, from 
the V{, a vibrational partition function for the activated complex—except 
we omit the factor belonging to the {-motion. Thus of the 3n nuclear 
degrees of freedom of an activated complex, 3n — 1 can be handled just 
as with stable molecules. Only the reaction normal coordinate { requires 
special treatment. 

Let q* represent the vibrational partition function of the activated com¬ 
plex, omitting the {-factor. Also, let q* = q*qr*q*q* be the complete 
partition function of the activated complex, just as for any polyatomic 
molecule—except, again, omitting the { degree of freedom. 

We wish next to calculate the number of activated complexes per unit 
volume of the system which are in an infinitesimal range d{ of the reaction 
coordinate at the activated state. We shall call this number p' d{, so that 
p' is the number of activated complexes per unit volume and also per unit 
length along the reaction coordinate { at the activated state. We want 
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the number p' d£ to include all activated complexes in d£ irrespective of 
the value of p*, the momentum conjugate to £. For the element of phase 
space d£ dpt in the coordinate £, the (classical) partition function is 

I d(dp h (11-4) 


where m* is defined by pi = m*£. It should be noted, incidentally, that 
there is always some arbitrariness in defining normal coordinates [see the 
constants O in Eqs. (V-12) and (V-14)], and so there is arbitrariness in 
m* and £. But combinations of these quantities with physical signifi¬ 
cance (e.g., Pj/2m* or d(dpt above) are not arbitrary. Integration of 
(11-4) over pi gives the £ partition function for an activated complex in 
djj. The complete partition function for such an activated complex is then 



With the assumption, already mentioned, of equilibrium between re¬ 
actants (say A and BC, for concreteness) and activated complexes, 


A + BC*± {ABC)*, 


( 11 - 6 ) 


we have, as in Eqs. (10-6) and (10-31), 

p'd£ (q*/V)(2irm*kT/h 2 ) 112 d£ 

PaPbc (Qa/V){Qbc/V) 

_ (‘ <t*'/V)(frrm*kT/h 2 ) m di _-&„+ikT 
Wa/V)% c /V) 


where <i means, as before, that q, is omitted from q. The quantity A uf is 
defined for the process (11-6) by analogy with (11-2) and (11-3). If the 
ground electronic states are degenerate, a factor wf 1 /w el (il)« 4 i(B(7) must 
be included in Eq. (11-7) (w,* being the degeneracy of the activated 
complex). 

Equation (11-7) provides us with an explicit equation for p', the num¬ 
ber of activated complexes per unit volume of the system and per unit 
length along £ at the activated state. Our next task is to calculate the 
number of activated complexes per unit volume which cross the potential 
barrier X per unit time in the direction of reaction (left to right in Fig. 
11-1). Assuming all of these complexes become products, this is the rate 
of the reaction. Consider those activated complexes with values of £ be¬ 
tween £ and £ + d£. The fraction of all activated complexes which are 
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in this class is, from (11-4), 

e -m*?l2kT ^ 




/ +« 


n * i’/atr 


dk 


( m* V ' 2 

~ \2jt kTj 6 




dt 


Suppose £increases as the activated state is approached from the reactant 
(left) side in Fig. 11-1. Then an activated complex with | > 0 is pro¬ 
ceeding along the reaction coordinate in the direction required for the 
reaction to take place. We note that activated complexes with a given 
value of £ > 0 will cross the potential barrier in unit time if they start at 
a distance from X not greater than a length of magnitude £. The number of 
activated complexes per unit volume and per unit length along £ having 
values of £ in the interval d£ is p'/(£) dt The number of activated com¬ 
plexes per unit volume in the length £ along £ having £ in the interval d£ 
is then £p'/(£) d£. This is the number of complexes per unit volume with 
£ in d£ which cross the barrier per unit time. To get the total number 
of complexes per unit volume crossing the barrier per unit time, we have 
to integrate £ over the range 0 < £ < + oo: 

✓ jf -✓(£»)"'(g) 

— k PaPbc, (11-8) 


which is the defining equation for k. The quantity kpAPsc is the rate of 
the reaction, and k(T) is called, conventionally, the rate constant. If p' 
is obtained from Eq. (11-7), we find 


, kT (q*'/V)e- Auf,kT 
* ~ h (ct A /V){q' BC /V) ‘ 


(11-9) 


If only a fraction k (called the transmission coefficient) of complexes pass¬ 
ing the potential barrier in the right direction actually proceed to products, 
then k must be inserted as a factor on the right-hand side of Eq. (11-9). 
This situation arises, for example, in cases such as Fig. 11-4, where the 
system passes over a barrier ( X ) but then finds itself in a basin. The sys¬ 
tem may leave the basin (via X r ) to form products or return (via X) to 
reactants. In the reaction H + H 2 —*• H 2 + H, the basin is symmetrical 
(if it exists), and it is usually assumed that k = 1/2. 

Equation (11-9) furnishes a straightforward statistical-mechanical 
recipe for calculating the rate constant k. The potential surface must be 
available, however, and this is a very serious practical obstacle. Because 
of this, it is difficult to test the theory in a really satisfactory way. Of 
course, one does not expect exact agreement between theory and experi- 
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ment, as the argument used to deduce Eq. (11-9) is not rigorous—the as¬ 
sumption of equilibrium between reactants and activated complexes being 
especially questionable. Also, if a transmission coefficient must be used, 
this introduces a somewhat nebulous feature into the theory since k is 
in general difficult to evaluate. 

For the reverse reaction (11-2), the rate constant is 


kT (q* '/V)e~ lAu * 

” h tf AB /V)%/V) 

The equilibrium constant K for the reaction (11-1) is then 

JC-.lL— Wab/V)Wc/V) ,-*u.lkT 
* ” W A /V)(q’ BC /V) 

in agreement with Eqs. (10—6) and (10-31). 


( 11 - 10 ) 


(11-11) 


11-3 A nonchemical application of the Eyring theory. The fundamental 
ideas in Eyring’s theory of the rate of chemical reactions can be and have 
been applied to many physical rate processes as well. Merely to illustrate 
the possibilities, we consider here a particularly straightforward example, 
namely, the rate at which monatomic molecules adsorbed at localized 
sites on a surface jump from one site to another. This rate is of course 
closely related to the coefficient of surface diffusion. The model we con¬ 
sider is essentially that already discussed in Sections 7-1 and 9-6. We 
have a lattice of equivalent surface sites for adsorption, but we need not 
specify the lattice type. The number of adsorbed molecules is small, so 
that each one behaves independently. The potential in which a molecule 
moves is U 0 (x, y) [see, for example, Eq. (9-19)]. The potential wells in 
U 0 (x, y) are the sites for adsorption. The partition function for an ad¬ 
sorbed molecule at a site is given by Eq. (7-3). 

To move from a given site to a nearest-neighbor site, a molecule must 
pass over a potential barrier of height Vo • The top of the barrier is the 
activated state for this process: 

A (site) —» A* (top of barrier) —» A (neighboring site). 

Let £ and y be the normal coordinates at the activated state, which is a 
saddle point in the surface U 0 . We take £ as the “reaction coordinate.” 
That is, the coefficient of £ 2 in the expansion of Uo in powers of £ and y 
about the activated state is negative, while the coefficient of y 2 is positive. 
Thus a molecule at the top of a potential barrier vibrates in the usual way 
in the z-direction (perpendicular to the surface) and also in the indirection 
(perpendicular to the direction of passage from one site to the other, i.e., 
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perpendicular to |), but not in the |-direction. We denote the z and i\ 
vibrational partition functions at the activated state by q* and q*. 

The partition function for a molecule in the element of length d£ at the 
top of a barrier is [see Eq. (11-5)] 

qtqt d* e~^+ v <‘ ilkT . ( 11 - 12 ) 

Let N' d£ be the equilibrium number of molecules in a length d£ at the 
top of barriers, and let N be the equilibrium number of molecules in sites. 
Then, from Eq. (10-5), the ratio of these two numbers is 

N'dj = qiqf(2tirmkT/h 2 ) lli d£ e~ v ° lkT ( 

N q x q v q t M ’ ' 

where M*/M is the ratio of the number of activated states to the number 
of sites (this ratio is two for a square lattice). By the same argument as 
in the preceding section (Eq. 11-8), 

""(as?)"’(!) < n - 14 > 


is the number of molecules crossing a barrier (or the number of jumps 
being made from one site to another) per unit time, where N' is given by 
Eq. (11-13). We are assuming here that there are no “rebounds”: k = 1. 
If t is the mean time a molecule spends at a site between jumps, then a 
second expression for the number of jumps occurring in unit time is N/t. 
If we set N/t equal to (11-14), we find 


1 = kT M+q*q?e- Vo,kT 
t h Mq x qtf, 


(11-15) 


This is the analog of Eq. (11-9). That is, 1 /t is the rate constant for this 
process. 

To obtain an estimate of the order of magnitude of r in Eq. (11-15), 
we set M*/M = 2, q* = q v [as would be the case with Eq. (9-19)], 
qf — q„ and q x = kT/hv x (classical). Then 

- = 2v x e~ r ° lkT . (11-16) 

T 


This equation has the following approximate interpretation: 2v x is the 
number of “attempts” per second the molecule makes to leave its rate; 
e -r„ikT |g t jj e probability that any particular attempt will be successful; 
and hence 1/r is the actual number of jumps a molecule makes from one 
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site to another per second. If we take v s — 3 X 10 u sec P 0 = 600 
cal-mole -1 , and T = 80°K, then e~ 7 * lkT = 0.043 and r = 3.9 X HT 11 
sec. 

Equation (11-15) also provides a theoretical equation for the coeffi¬ 
cient of surface diffusion D, since D is related to r (from the theory of 
random walks) by D = Ca 2 /r, where a is the distance between nearest- 
neighbor sites and C is a constant of order unity which depends on the 
lattice type. 


Problems 

11-1. Write out the explicit forms for the partition functions in Eq. (11-9), 
assuming the triatomic complex (ABC)* is linear. Insert typical orders of 
magnitudes for the masses, frequencies, bond distances, etc., to estimate a 
magnitude for k. 

11-2. Derive an equation for d In k/dT from k in Problem 11-1. 

11-3. Discuss the rate of diffusion of impurity atoms in a monatomic crystal 
from the point of view of Eyring’s theory. 

11-4. Consider the rate of evaporation of a dilute localized monatomic 
monolayer into the gas phase, (a) Use Eyring’s method to derive an equation 
for l/r„ where r, is the mean time a molecule spends on the surface before 
evaporating, (b) Derive the same expression for 1/r, by equating the number of 
molecules condensing on the surface per unit area and per unit time with the 
number evaporating, at equilibrium: 

V (N/a) 

(2wmkT)U* “ r, ’ 

where p is the equilibrium gas pressure and JV/Ct is given by Eq. (7-10) in the 
limit as 0 —> 0. 


Supplementary Reading 
Fowler and Guggenheim, Chapter 12. 

Frenkel, J., Kinetic Theory of Liquide. New York: Dover, 1955. 
Glasstone, S., Laidler, K. J., and Eyeing, H., The Theory of Bate Processes. 
New York: McGraw-Hill, 1941. 
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CHAPTER 12 

IDEAL GAS IN AN ELECTRIC FIELD 

Our principal object in this chapter is to deduce the thermodynamic 
properties of a very dilute gas in an electric field. In Chapter 15, this study 
will be extended briefly to slightly imperfect gases. 

In Section 12-1 we give some necessary thermodynamic background, 
and in Section 12-2 we develop general statistical-mechanical equations 
(canonical ensemble). The material in the first two sections is quite 
general and would apply to any fluid (or isotropic) dielectric. The dilute- 
gas special case is then considered in Section 12-3. Finally, in Section 
12-4, we discuss a somewhat related problem: a lattice of noninteracting 
magnetic dipoles in a magnetic field. This problem turns out to be formally 
the same as that of the ideal lattice gas in Chapter 7. The interacting 
magnetic dipole case (the Ising model for ferromagnetism) is included 
in Chapter 14. 

12-1 Thermodynamic background. A number of alternative and equiv¬ 
alent thermodynamic formulations can be devised for a dielectric fluid in 
an electric field. Koenig* has given a very full discussion of this subject. 
The corresponding treatment for magnetic systems is contained in a paper 
by Guggenheim, f We confine ourselves here to the one particular formula¬ 
tion that is most convenient in the statistical mechanics of gases in an 
electric field. For condensed systems, there are some advantages to other 
choices. 

Consider the parallel plate condenser in Fig. 12-1. The plate surface 
charge densities are -Hr and —<r, as indicated. The condenser is assumed 
to have a large enough plate area so that edge effects can be ignored. The 
volume V contains the dielectric fluid whose properties we are interested 
in. For simplicity, we take the fluid as one component with N molecules, 
but it could as well be multicomponent. The same is true in Section 12-2. 
One wall of the fluid container, parallel to the condenser plates, serves as a 
piston to vary the volume V. The equilibrium pressure on the piston is p. 
The regions between the fluid container and the condenser plates are 
evacuated. As a consequence of polarization of the dielectric in the field 
of the condenser plates, there are induced surface-charge densities —o' 


* F. 0. Koenig, /. Phys. Chem. 41, 597 (1937). 
t E. A. Guggenheim, Proc. Roy. Soc. 155A, 49, 70 (1930). 
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Fig. 12-1. Dielectric fluid in volume V between two condenser plates. V 
can be varied by a piston with pressure p. 

and -Hr' on the inside surfaces of the indicated walls (Fig. 12-1) of the 
fluid container. 

Let us consider the electric field acting at any point in the fluid. The 
field due to the condenser surface charge -Hr is 2ir<r, as can easily be de¬ 
duced from Coulomb’s law (Problem 12-1). The field arising from the 
surface charge —«r is also 2x<r (acting in the same direction). The total 
field from the charge on the condenser plates is therefore 4ir<r. This field 
is called the dielectric displacement D; D = 4x«r. We regard D as a 
variable intensive parameter on which the thermodynamic properties of 
the fluid depends. D is an external field; that is, it can be controlled from 
outside the thermodynamic system (fluid) itself. At a point in V, D has 
the same value whether or not a dielectric fills V; that is, it is determined 
solely by <r. In addition to the field D due to external charges acting at the 
point in V, there is clearly also a field —4w<r' due to charges induced on 
the surfaces of the dielectric. An equivalent point of view is that this 
second field arises from molecular dipoles throughout V oriented some¬ 
what in the external field D. It is customary to use the symbol P (polar¬ 
ization) in place of <f. Then the total electric field acting at a point in V 
is D — 4 tP. This field is denoted by 8 (electric field strength): 

8 = D - 4 vP, D = 8 + 4tP. (12-1) 

8, D, and P all have the same sign. The dielectric constant t is then 
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defined by D = eg; necessarily e > 1. In this chapter we shall avoid the 
use of the symbol e as a molecular energy level. 

We now consider the thermodynamic functions of the fluid molecules. 
In these functions we include only contributions from the molecules 
themselves, and exclude contributions that would be associated with the 
space V and field D if no molecules were present in V. This is a natural 
choice in a molecular theory. Thus, the energy E includes the molecular 
kinetic energy, the potential energy of interaction of the molecules with 
the external field D, and the potential energy of interaction between the 
molecules themselves. 

In the expression 

dE = DQ* - DW + n dN, (12-2) 

there are two contributions to DW. One is the usual p dV term associated 
with a volume change (N and D constant, DQ* = 0), and the other is 
related to changes in D (N and V constant, DQ* = 0). Consider the work 
that must be done on the entire system shown in Fig. 12-1 if <r is to be 
changed by da. Let Ct be the cross-sectional area of a condenser plate. 
Then we have to transport an amount of positive charge G da from the 
negative condenser plate to the positive plate. This charge has to be 
moved against a field 6 through the distance L (fluid) and against a field D 
through the distance V — L (vacuum). The total work done on the 
system is thus 

SLG da + D(L' - L)a da. (12-3) 

But to get the quantity we want, the work done on the fluid, we have to 
subtract from (12-3) the work which would have to be done to increase a 
by da when the volume V is evacuated. This work is 

DL'a da. (12-4) 

On subtracting (12-4) from (12-3), we get 

&LQ,da - DLada = -PVdD. (12-5) 

This is work done on the fluid. The desired contribution to DW is +PF dD, 
since DW is work done by the fluid. 

We note that PV = (u'G)L, which is just the total dipole moment 
(charge X separation distance) of the fluid in V. We call the total moment 
M and replace PV by M. Finally, then, Eq. (12-2) becomes 

dE= TdS - pdV - MdD + ydN. (12-6) 

Koenig has shown that the pressure p in this equation has the operational 
significance implied by the particular kind of volume change indicated in 
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Fig. 12-1. On integrating Eq. (12-6), holding intensive properties con¬ 
stant, we find 

E = TS - pV + nN, 

(12-7) 

F = nN = A + pV. 

Alternative useful forms of Eq. (12-6) are 

dA = -SdT - pdV - MdD + ndN, (12-8) 

d(pV) = SdT + pdV + MdD + N dp. (12-9) 

12-2 Statistical-mechanical background. We are concerned with a 
fluid system of N molecules in a volume V. The system is placed in the 
external electric field D (Fig. 12-1). We regard D as a parameter. The 
possible energy states of the system, when D has a particular value, must 
be found from quantum mechanics and are denoted by Ej(N, V, D). 
The argument up to Eq. (1-13) in Chapter 1 is unchanged, but in 
Eq. (1-13) we now write (N constant) 

Just as we defined p, as the pressure in state Ej (Eq. 1-7), here we define 



as the total moment of the system in state Ej. That is, —Mj dD is the 
total work that has to be done on the system, when in the state Ej, in 
order to increase D by dD. Then Eqs. (1-15) and (1-16) become 

TdS = dH + fdV + 31 1 dD, (12-10) 

where 317 is the ensemble average of the mechanical variable Mj: 


37 = £ PjMj. (12-11) 

i 

Equations (1-29) through (1-31) are still obtained, for any assigned 
value of D, and we have 

Q(N, V, T,D) = 'E e - E i iN ’ v ' m,kT , (12-12) 

i 

A(N, V, T, D) - hT\nQ(N, V, T, D). 


(12-13) 
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Thus the connection between thermodynamics and statistical mechanics 
is established. In particular, from Eqs. (12-8), (12-12), and (12-13), we 
find 




') = If* 

/ n.v.t j 


(12-14) 


In an open system (independent variables V, T, n, D), the grand parti¬ 
tion function is defined as 

%(V, T, /i, D) = 2 Qn(V, T, D)\”, (12-15) 

N 


where X = e l ‘ lkT and Qn(V, T, D) = Q(N, V, T, D). We can easily verify 
from the rule (1-92) and Eq. (12-7) that S = e pV!kT here, as in the 
absence of a field D. Equations (12-9) and (12-15) then provide us with 
all the necessary interconnections between S and thermodynamics. For 
example, 


73 = kT 


' d In g\ 
k dD Jt.v ,ji 


(12-16) 


&Qn\ . n _ YLn MnQn\ n 
S Zjf \dDj v ,T ~ EnQnX* ’ 


(12-17) 


where Mn(V, T, D) is new notation for the canonical ensemble average 
73(N, V, T, D) in Eq. (12-14). The average indicated in Eq. (12-16) is 
over N; each Ms in Eq. (12-17) is already averaged over j. 

We shall return to the above equations for an open system when we 
discuss an imperfect gas in an electric field in Section 15-5 (see also 
Problem 12-2). 


12-3 Dilute gas in an electric field. We consider a one-component gas 
which is dilute enough so that each molecule behaves independently and 
classical statistics may be used. Then in the canonical ensemble (Eq. 3-10), 

<MF, T, D ) = , (12-18) 

where Qi here replaces the symbol q used earlier (for example, in Chap¬ 
ters 3, 4, 8, and 9). Equations (12-8), (12-13), and (12-18) then give for 
the total moment of the system, 

"“O'. T ■ °) - tT (rm !i \. r .r - mT (^)y.r 

= NMiiD, T), (12-19) 
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where M j is the canonical ensemble average dipole moment of a single 
molecule at temperature T and in the external field D. The chemical 
potential is 

»~- kT (rsir) D . rr -- iT '“W’ >-&• < 12 - 20 > 

The equation of state will be deduced below. 

The problem is reduced by the above equations to one of finding the 
partition function Q\{V, T, D) of one molecule in a volume V, external 
field D, and temperature T. We consider, therefore, a single diatomic or 
polyatomic (without internal rotation) molecule, in a box of volume V, 
and in an electric field D. Suppose the molecule has a permanent dipole 
moment ju 0 - Also, we assume that the field induces in the molecule a further, 
additive, moment aD (in the direction of the field), where a is the polariz¬ 
ability. The polarizability is a purely quantum-mechanical quantity, 
independent of T. The work necessary to create this induced moment is 
[compare Eq. (12-6)] 

- f\ Di D - 

Let 6 be the angle between the direction of the field and the axis of the 
permanent moment /to (Fig. 12-2a). In the presence of the field, the 
rotation of the molecule is not free, because of the dependence on 0 of the 
potential energy of the permanent dipole (Fig. 12-2b): 

V = —poD cos 9. 


[This problem should be compared with those associated with Eqs. (9-18) 
and (9-19).] The total electrostatic potential energy XJ\ of a molecule in 
the field D is then 



Fig. 12-2. (a) Permanent dipole mo oriented at an angle 6 with respect to 
the field D. (b) Potential energy of dipole as a function of 6. 
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We assume that U x simply makes an additive contribution to the 
Hamiltonian of the molecule. Let us call U j the rotational potential 
energy. The rotational Hamiltonians of Chapters 8 and 9 contain kinetic 
energy terms only. We treat rotation classically, as in these earlier chapters. 
Let <p be an azimuthal angle around the direction of the field D in Fig. 
12-2(a). In the absence of a field, all orientations (0, <p) of the axis of the 
moment no are equally probable. Therefore the probability of finding an 
orientation in the range dd d<p is proportional to sin 0 dd d<p. In the pres¬ 
ence of a field, some orientations are favored over others because of 
Eq. (12-21). The partition function Qi{D) in the presence of a field is 
altered from that in the absence of a field, Qi(0), only through the rota¬ 
tional potential energy U\. Thus we can easily take care of the effect of 
the field by extracting from Qi(0) the result, 4ir, of integrating over 0 and 
<p with U\ = 0, and then include an integration over 0 and <p in Qi(D): 

/•2v /•* 

Q 1 (£>) = 0l@l / e~ u,lkT sin 0 dd d<p. (12-22) 

wc Jo Jo 


In the notation of Chapters 8 and 9, 

Qi(0) = (12-23) 

where q t , q r , etc., are unchanged from our earlier treatment. 

We digress to note at this point that the equation of state of the gas is 


* - iT - mT - nr' 

We substitute Eq. (12-21) in Eq. (12-22) and find easily 


Qi(D) = Q,(0)e' 


iD*i 2 kT sinh y 


(12-25) 



v 

Fio. 12-3. Langevin function, £(y) = coth y — y~ l . 
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where y = n 0 D/kT. Then, from Eq. (12-19), 

Mi = aD + mo £(y), (12-26) 

where £(y) is the Langevin function (Fig. 12-3): 

£(y) = coth y — i • (12-27) 

In practice, y « 1. For example, if Mo = 1 Debye, D — 1000 volts-cm -1 , 
and T = 300°K, y = 0(10 -4 ). ITie expansion of £{y) about y — 0 is 
(Problem 12-3) 

Hy) = |-|§+*”- (12-28) 

Ordinarily only the first term is required. Similarly, 

ainhy _ , i vt . 

y 6 ^ '• 

Equations (12-25) and (12-26) become then 

«i<0) = O.(0)e-" , ' !CT [l + i (^)’] ■ (12-29) 


Here M\ is the average dipole moment of one molecule; aD is the induced 
moment (independent of T), and y\D/$kT e fi(T) is the statistically 
averaged component (in the direction of the field) of the rotating perma¬ 
nent moment mo- K the permanent moment were completely oriented by 
the field [D —* oo or T —* 0; £{y) —* 1], p would approach mo itself. 
That is, Amu = Mo- It is clear that at ordinary field strengths and tem¬ 
peratures the rotation is almost free and the orientation of mo in the 
field is very slight, for 


A _ MoD 
Amax 3 kT 


« 1 . 


In the limit as T —» oo or D —* 0, A —> 0. 
From Eq. (12-1), 


D = 


8 + 4irP = ~ + 


4irAf 
V ’ 


or 


e — 1 4 irM 

t ~ DV' 


(12-31) 
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In the limit as p = N/V —» 0, we then have the following equation for 
the dielectric constant e of the gas: 


e-l = 


4wiWifV 

DV 



(12-32) 


A plot of the experimental quantity (e — l)/4irpc against 1/T will give a 
as the intercept and pjj/3 k as the slope. This well-known method of 
determining permanent dipole moments is treated in physical chemistry 
texts and will not be considered further here. 

Finally, let us deduce some of the thermodynamic functions for a dilute 
gas in an electric field. First, from Eqs. (12-20) and (12-29), 


M m(0) 
kT kT 



(12-33) 


where p(0) is the value of p when D = 0 (see Chapters 8 and 9). Next, 
from Eqs. (12-8), (12-18), and (12-29), we find for the entropy 


S _ S( 0) 1 (fi 0 p\ 2 
Nk~ Nk 6\kT ) ' 


(12-34) 


The entropy is reduced in the presence of a field because rotation is slightly 
hindered. Also, for the energy, 


E _ S 
NkT ~ Nk 


1 i n ^(0) 
jylnQ mT 


kT (2 + 3 fcr) * (12_35) 


The added terms in Eq. (12-35), with the field on, are clearly just the 
average value of Ui/kT (Problem 12-4). 

The terms in D 2 in Eqs. (12-33) through (12-35) are very small, say of 
order 10~® to 10 -8 in typical cases. The leading term in each of these 
equations is of order unity. 


12-4 Lattice of noninteracting magnetic dipoles. In this section we 
discuss an idealized magnetic problem. Suppose we have a lattice of M 
equivalent magnetic dipoles (associated, say, with electron or nuclear 
spins), each of which can exist in only two orientations or states: f, in 
the direction of the magnetic field H; or J, against the field. The potential 
energy of a dipole or spin is —mH if oriented with the field (|), and +mH 
if oriented against the field (i), where m is the magnetic moment. The 
dipoles are assumed not to interact with each other (or, more accurately, 
only “weak” interaction is allowed—see Chapter 3); each dipole behaves 
independently of the rest. This is a good model for nuclear spin systems 
but not for electron spin systems (e.g., ferromagnetism). In our discussion 



210 


IDEAL GAS IN AN ELECTRIC FIELD 


[CHAP. 12 


of ferromagnetism in Chapter 14 we shall introduce nearest-neighbor 
interactions. 

Let N be the number of i states and M — N the number of f states. 
For a given value of N, the total potential energy of the dipoles in the 
field is 

mHN - mH(M - N) = (2AT - M)mH. 

For given 217, H, and T, let 77 be the average value of N (N can range 
from 0 to M). Then the work necessary to increase H by dH is —7 dH, 
where 

I — (M — 2 77)m. (12-36) 


In Eq. (12-36), 1 is the intensity of magnetization; it is the average excess 
number of f states over i states, multiplied by m. In the absence of a 
magnetic field, the two states have the same potential energy (zero), and 
therefore occur in equal numbers; hence 7 = 0 when 77 = 0. 

The basic thermodynamic equations we use are* 

dE = TdS - IdH+ndM, (12-37) 

dA = -SdT - IdH + ixdM. (12-38) 

On integrating (77 is intensive, 7 extensive), 

A = E - TS = nM. (12-39) 


For a given value of N, there are 217 !/2V!(M — N )! possible arrangements 
of the two states f and 1 among the M positions. Also, for given N, 
the energy of the dipoles in the field H is (2 N — M)mH, as already men¬ 
tioned. Therefore, the canonical ensemble partition function for a system 
of M dipoles at T and H is 


Q(M,T,H) = q« 


^ M\e< u - 2!f)mBlkT 
fa Nl(M — N)\ ’ 


(12-40) 


where q(T) is the “internal” partition function of a dipole—a quantity we 
need not specify further. The summation in Eq. (12-40) is easy to carry 
out. We find 

Q(M, T, H ) = q«e MmaikT (l + e ~ 2mr!lkT ) M 

= (2q cosh • (12-41) 

* In S. M., pp. 288-289, the quantity E m is equivalent to E + HI here, and 
the partition function A is equivalent to Q here. 
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Fra. 12-4. Magnetization (I/mM) of lattice of noninteracting magnetic 
dipoles as a function of magnetic field ( mH/kT ). 


From A = —kTlaQ, and Eqs. (12-38) and (12-41), we find for the 
magnetization 7, 

7 = (Af - 237)m = kT (^J/f) MT = mMtanh^- (12-42) 


This function is plotted in Fig. 12-4. When H = 0, 7 = 0 (i.e., 37 = M/2). 
When H —> +co, all dipoles are oriented t > and 7 — mM (i.e., 37 = 0). 
When H —> —oo, 7 = —mH and 37 = M. 

The formal resemblance between this model and Langmuir adsorption 
(or an ideal lattice gas) is rather obvious. See also Section 7-4 on the 
elasticity of a polymer chain. The two states f and J. correspond to an 
empty site and an occupied site, respectively. If we denote the fraction of 
dipoles in the state i by 8, then 


7 

mM 


= 1—2 8 = tanh 


mH 
kT ' 


(12-43) 


The Langmuir adsorption isotherm, 8 = gX/(l + q\) (Eq. 7-16), can be 
put in the same form: 

1—2 8 = ^ ^ = tanh x, (12-44) 

where 

a; = - In (q\) 112 . 

In the magnetic, adsorption, and polymer elasticity problems, the magnetic 
field, chemical potential, and force, respectively, play corresponding roles 
in shifting the relative population of the two states involved in each case. 
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Problems 


12-1. Use Coulomb’s law and an integration to show that the (normal) 
electric field of an infinite plane sheet of charge, with charge density <r, is 2 ro. 
(Page 202.) 

12-2. Deduce the equations pV — 7JkT, X — TJ/Qi, and M <= TJMi from 
Eqs. (12-9), (12-15), and (12-18). (Page 205.) 

12-3. Show that 


£(V) 



(Page 208.) 

12-4. Show that 


Ut 


£ Uie~ Vt ' kT sin 6d8 
f\- u ' lkT Bin 6 dS 


(Page 209.) 

12-5. Show that 



+ •••• 


f po cos 6 e’‘ oD * 0 ** ,tr sin BdB 2 

a - h _ 

C ^ Da ^ tkT Bin 9 d6 3kT 

Jo 

12-6. If mo D 2> kT, a gas molecule will oscillate about 9 — 0 instead of 
rotating. In this case it is appropriate to express p as a power series in T instead 
of T-K Find the first two terms in the series. 

12-7. Obtain the basic thermodynamic functions for a sample of an ideal gas 
at a height A in a gravitational field. Take the gravitational potential energy es 
zero at h = 0. Find the density of the gas as a function of h, by use of the 
equilibrium condition p(h) = p(0), where p = chemical potential. 

12-8. Calculate the value of y = mo D/kT for mo “ 1 Debye, D — 1000 
volts-cm -1 , and T = 300°K. 

12-9. Calculate e — 1 from Eq. (12-32) for H 2 O vapor at 100°C and 1 atm 
pressure. Take a = 1.68 X 10 ~ 2i cm 3 and mo = 1.84 Debye. 

12-10. Derive an expression for the equilibrium ratio of the number of 
| dipoles to the number of 1 dipoles, for given H and T. Show that if suddenly 
the field is switched from H to — H, and if the distribution of dipoles between 
t and | remains unchanged an appreciable time before adjusting to the new 
direction of the field, then during the time before adjustment the system has, 
in effect, a negative temperature. This is a useful point of view in connection 
with some nuclear spin systems. 

12-11. Derive equations for S/Mk and E/MmH as functions of kT/mH 
(take g = 1). 

12-12. Investigate fluctuations in N and / in the magnetic system of Sec¬ 
tion 12-4. 
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CHAPTER 13 

CONFIGURATION OF POLYMER MOLECULES AND 
RUBBER ELASTICITY 


In this chapter we present a brief introduction to polymer configuration 
problems and to the theory of rubber elasticity. The treatment is based 
to a considerable extent on the work of James and Guth (Supplementary 
Reading list). A number of other topics having to do primarily with 
polymer molecules in solution will be discussed in Chapter 21. 

The basic prototype for polymer chains can be represented as 

R R R R R 

/\/\/\/\/\ 

R R R R 


where R is the monomer unit, and the chain continues indefinitely at both 
ends. The configuration shown above is the fully extended configuration. 
Actually, because of rotation of the attached groups of R’b around each 
R—R bond, a great many configurations are possible, of which the ex¬ 
tended configuration is only one. One of the fundamental problems in 
polymer statistics is to deduce the relative number of configurations of a 
long polymer chain consistent with a specified end-to-end distance (Fig. 
13-1). This problem is closely related to problems in brownian motion, 
random walks, diffusion, etc. 



Fiq. 13-1. Long polymer chain with end-to-end vector r. 
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One of the simplest polymer chains, polymethylene, has R = CH 2 . 
Other well-known possibilities for R are: 


C«H 5 

—CH 2 —CH— 

Polystyrene 

In typical cases the number of monomers in a chain might be from 100 to 

10 , 000 . 

13-1 Freely jointed chain. To handle this problem we first have to 
generalize the formalism of Section 7-4. Consider a linear polymer chain 
made up of M units, where M is large enough so that one chain can be 
considered a thermodynamic system. Each unit can exist in the states 
i = 1, 2, ..., » with partition functions ji(T) and lengths i,. The total 
length of the chain is l. The system (chain) is characterized thermo¬ 
dynamically by l, M, T. The canonical ensemble partition function is then 

Qd,M,D = (13-1) 

11 i—1 ” »' 

where Mi is the number of units with length U, and the sum is over all 
sets M = Mi, M a ,..., M n consistent with the restrictions 

2 M < = M > (13-2) 

»-i 

]£ UM { = l (13-3) 

i—1 

Equation (13-1) is a rather obvious generalization of Eq. (7-58). Here, 
for purposes of symmetry, we choose l as independent variable instead of 
one of the M, [M a was used in Eq. (7-58)]. The appropriate thermody¬ 
namic equation is 

dA= -SdT + rdl + ndM, (13-4) 

with 

A = —kT]nQ (13-5) 

and r = force pulling on the chain. 

The restriction (13-3) is troublesome; to avoid it we change to another 
partition function. We use the partition function 

d(r, M,T) — Y,Q«,M,De rtlkT 

i 


CH 3 

I 

—ch 2 —c=ch—ch 2 — 

Rubber 


(13-6) 
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This is the analog of Eq. (1-87). The connection with thermodynamics is 


dF = -SdT - Idr + ndM, (13-7) 

F = A — rl = fiM = —kT In A. (13-8) 


We substitute Eqs. (13-1) and (13-3) in Eq. (13-6) and obtain 


a(t,m,d = E M, n 

M t-1 


U<e"« kT ) u < 

Mil 


where now the only restriction on sets M is (13-2). The sum can be car¬ 
ried out immediately, and we have 


A(r, M, T) = (£ jS**Y m £(r, T) u , 


(13-9) 


This gives, for example, for the average length I of the chain at a given 
force r, 

l - - (Z) - H-(^) = 

\9t/m,t \ dr / u,t \ Br /t 


_ ZilQd, M, De Tlltr „ Zilj,* TliliT 
" "LiQdyM, De r,lkT ~ ' 


(13-10) 


Equations (13-1) through (13-10) are formally the same as Eqs. (7-23) 
through (7-29). Therefore the notation A = £ u in Eq. (13-9) is appro¬ 
priate. The partition function £ for one unit has the same form as A in 
Eq. (13-6) for the entire chain [just as £ in Eq. (7-27) resembles S in 
Eq. (7-25)]. 

We now consider a special case, a chain of M units, each of length a, 
with “free” joints between units. That is, if we choose one end of any 
unit as origin, the other end of the unit moves freely (in the absence of a 
force on the chain) over the surface of a sphere with radius a (Fig. 13-2). 
The ends of the chain are a distance l apart and are on the x-axis. If the 
left end of the chain is considered fixed, we want to calculate, among other 
things, the equilibrium force r along the x-axis necessary to hold the chain 
extended a distance l (Fig. 13-2). Real polymer chains do not have-free 
joints between monomers (R units), but an approximate connection can be 
established between real chains and this idealized model (see Section 13-2). 

The contribution of any one unit to l can range from —a to +o. Thus 
li in Eq. (13-9) can vary continuously between these limits. We use x for 
this continuous variable. It is clear from Eq. (13-10) that ji is proportional 
to the probability of a length U being observed when there is no force on 
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Fio. 13-2. Two-dimensional version of freely jointed chain. Each unit is of 
length a. The ends are on the x-axis and are a distance l apart. 

the chain (r = 0). When there is a force, jit Tl * lkT is proportional to this 
probability. In the present problem, then, we let j(x, T)dx be propor¬ 
tional to the probability that the end of a freely moving unit (Fig. 13-2) 
will have an x-component in dx. It is easy to see (Problem 13-1) that 
this probability is in fact independent of x (in the range — o < x < +a). 
Therefore we have 

( = j(T)e TXlkT dx 

= ^sinhf, (13-11) 

where t — ra/kT. From Eq. (13-10) we find for the length-force relation 

l = MkT (~~) t = Ma£(t), (13-12) 

or 

where £ is the Langevin function defined in Eq. (12-30) and £ -1 is the 
inverse Langevin function. The maximum extension is Ma; to achieve 
this we need t — ► oo. Figure 12-3 provides us with a plot of l/Ma (ordinate) 
against t (abscissa). 

The occurrence of the same (Langevin) function here and in Section 
12-3 is not surprising. In Section 12-3 we were dealing with freely ro¬ 
tating dipoles perturbed in their rotation by an electric field. Here we 
have freely rotating units of a chain perturbed in their rotation by a force 
pulling on the chain. 
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At small extensions (Z « Ma), we use £(t) = t/3 and find the linear 
relationship 



or 


_ 3 kTl 
T Mo 2 ' 


(13-14) 


Thus Z = 0 when t = 0, which is what we should expect on symmetry 
grounds (Z can be positive or negative; with r > Owe have Z > 0). 

The question of fluctuations is of some interest. For example, when 
r = 0 does the value of Z fluctuate much about Z = 0? By the methods 
of Section 2-1 we find (Problem 13-2) from Eq. (13-10) for the fluctuation 
in Z at fixed r, 

(£)"'■ (13-15) 

From Fig. 12-3, we see that dl/dr is largest at small extensions. In the 
limit as r —» 0 (Eq. 13-14), 


Zq _£?_1_ 

(Ma) 2 “ (Mo) 2 ~ 3M ’ 


(13-16) 


where we compare the fluctuation Z| with the maximum extension Ma in¬ 
stead of Z, since Z —» 0 as r —» 0. The subscript on Z§ refers to r = 0. 
Thus, if M = 3000, «rj /Ma = 10 -2 , a rather significant fluctuation. 
The conclusion we draw from this is that chains of this size are not quite 
macroscopic in the thermodynamic sense. Hence, although quantities such 
as r, Z, Z 2 , T, etc., are well defined (see Sections 3-4 and 7-2) and equations 
of the type (13-10), (13-13), and (13-15) are valid, functions such as E, 
F, S, A, etc., for a single chain are slightly fuzzy in their thermodynamic 
significance. These latter functions become sharply defined, of course, for 
a system consisting of a large number of chains (as is always the case in 
practice). _ 

From Eq. (13-16) we have that Zg = Mo 2 /3. Then Eq. (13-14) can 
be written 



Actually, the length-force equation in this form is applicable to small ex¬ 
tensions of any kind of chain with l 0 = 0. This relation follows directly 
from Eq. (13-15) and the necessary (by symmetry) linear dependence of 
r on Z at small Z. 

The generalization of Eq. (13-17) to Zo ^ 0 (as, for example, in Section 
7-4) and extensions that are not small can be deduced by a method which 
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has very wide applicability in statistical mechanics. Define A 0 from Eq. 
(13-6) by 

A 0 (M, T) m A(0, M,T) = '£ Q<!> M > r )> 

1 

and rewrite Eq. (13-10) as 

l -mX (I3 - ,8> 

Now if we expand the exponential in Eq. (13-6), we find 

A_ _ TTi-o (l/fl) EtW, M, T)(Tl/kT)‘ _ 1 (r_\*J 

A 0 ~ EiQd, M, T) - £ i\ \JctJ l0> 

where lo, a function of M and T in general, is the average value of f* at 
zero force. This follows from Eq. (13-10), which states that the prob¬ 
ability of the chain having a length l is proportional to Qe n/kT when the 
force is t and to Q when 7 = 0 [see also Eq. (2-13)]. The occurrence of 
“unperturbed” (r = 0) averages is the essential point here, since these 
are not so difficult to calculate. Equation (13-18) becomes, then, 





Eto (lAO (r/kTyi' 0 


(13-19) 


This gives l — l 0 as a power series in r/kT, or vice versa. We shall not 
pursue this method further here (but see Problem 13-3). 

As just indicated, Q(l, M, T) is proportional to the probability that the 
free (r = 0) chain has a length l (for given M and T). An equivalent 
statement is that Q(l, M, T) is proportional to the number of configurations 
(a configurational degeneracy) the chain can assume consistent with a 
length /, for given M and T. The dependence of this probability on l is 
perhaps the most important single property of a polymer chain. We 
are now in a position to deduce this dependence for a freely jointed chain 
(and more general chains) from Q. The same results can be obtained from 
the theory of random walks, without use of any of our statistical thermo¬ 
dynamical formalism. 

The general method we employ is to integrate the length-force relation 
to obtain A and hence Q, using Eqs. (13-4) and (13-5). It should be noted 
that at this point we make use of macroscopic thermodynamics; hence in 
the following we are dealing implicitly with the limit of very long chains 
(M —> oo). For the freely jointed chain, 

dA = t dl = £~ l dl (T, M constant), 
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and therefore 


Qd, M, T) 
<2(0, M, T) 



(13-20) 


This is the probability of a free chain having a length l relative to the 
probability of a length l — 0. It is also the ratio of the number of con¬ 
figurations of the chain with length l to the number with length zero. If 
we use the expansion (Problem 13-4) 


£-'(x) = 3* + fz 8 + 


(13-21) 


which can be deduced from Eq. (12-31), Eq. (13-20) becomes 

on* 

When l <K Ma, we keep just the first term in this expansion and obtain 
the gaussian probability distribution ordinarily used, 


Q(l, M, T) _ 
Q(0,M,T)~ e 


(13-23) 


This equation also follows directly on integrating the linear length-force 
equation, (13-14). Thus the gaussian probability distribution for the 
length of a free chain and the linear length-force relation for a chain under 
an extending force have the same limits of validity (Problem 13-5). To 
go beyond the linear length-force range, configurations of the chain with 
values of l outside the gaussian region become involved. That the “gaus¬ 
sian region” is in fact quite extensive can be seen as follows. The ratio of 
the correction term in Eq. (13-22) to the gaussian term is (3/10)(f/Afa) 2 . 
Even for a very large extension, this quantity is small compared with 
unity. For example, take M — 1000 and an extension V of ten times the 
root mean-square extension {Ma 2 / 3) 1/2 . Then 


3 ( V V _ 10_1_ 

10 XMa) “ M~ 100' 


We have been emphasizing the probability significance of Q and Qe TlliT 
for the length of a chain with fixed force. But one must also keep in mind 
that Q has the usual connections with the thermodynamic properties of a 
chain with fixed length (fluctuating force). An example is the deduction of 
the linear l — r relation from Q(I) in Eq. (13-23) (Problem 13-6). An¬ 
other example is the derivation of an equation for the dependence of the 
entropy S of a chain on its length l. In the present model, Q(l, M, T) has 
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the functional form (Eq. 13-20) 

<3(1, M,T) = <3(0, M, T)f{l, M). (13-24) 

This relation, combined with Eq. (1-33) for S, leads immediately to 


S(l, M,T) - S(0,M,T) = fc In/ = fc In [Q/Q(0)] 



(13-25) 


(13-26) 

The entropy is a maximum (largest number of configurations) at 2 = 0 
and decreases with increasing 2. The right side of Eq. (13-25) approaches 
— oo when 2 —» Ma. This, however, is pushing the model too far: a real 
polymer molecule, when fully extended, will not be rigid, but will have 
internal vibrational motion. The analog of this situation for an ideal gas 
is letting V —> 0 in Eq. (4-20). 

It is possible to write Eq. (13-20) in an alternative and more explicit 
form. Thus, from Eqs. (13-11) and (13-13), 


Q _ e ~ AlkT = A e- TllkT 




Here again we should note that the limit M —» oo is implicit, since we 
have made use of the thermodynamic equivalence of the partition func¬ 
tions Q and A. Equation (13-22) may also be obtained from Eq. (13-27) 
(Problem 13-7). 

From Eq. (13-19) we can derive a more general version of Eq. (13-23) 
(or Eq. (13—22)] for any polymer chain (see also Problem 13-3). We 
integrate dA = r dl, where 

r mi - fo) 

II - do ) 2 ’ 


(13-28) 
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and obtain 


gft m, n 
Q(l o, M, T) 



1 

2 $ - <4> , J’ 


(13-29) 


where M is a number proportional to the mass of the polymer molecule. 
Thus a gaussian probability distribution about l =_2 0 = 2o> for small 
extensions, is always found. Since in general 2o and 1% are functions of 
temperature, Eq. (13-26) is somewhat more complicated here. For ex¬ 
ample, if lo(T) = 0, we find (Problem 13-8) 

S<(, If. T) - s(0, = fef) J' <13 - 30) 

We should expect Z|j to increase with temperature for a real molecule 
owing to increased freedom of rotation about chemical bonds in the chain. 

As a final topic in this section, we consider briefly the one-dimensional 
version of a freely jointed chain. The chain has M units, each of length a. 
Each unit must now always lie on the z-axis so that the possible contribu¬ 
tions of a unit to l are the two values —a or +a. Thus the chain resembles 
a folding ruler. In random-walk language, this is a random walk along a 
line with each step of length +o or —a. In Eq. (13-9), we take l x — +o, 


h — —a, and j x — j 2 — j. Then 


A= ( 

f Af 

jycosh w ) 

(13-31) 

and, from Eq. (13-10), 



I = Ma tanh t, 

t = ra/kT, 

(13-32) 

1 = tanh_i Gfc) 

- _L + i(_LV + ... 

Ma ^ 3 \Ma/ ^ 

(13-33) 


Just as the three-dimensional freely jointed chain under a pulling force 
resembles a gas of dipolar molecules oriented by an electric field (Section 
12-3), the one-dimensional freely jointed chain under a force resembles a 
system of magnetic dipoles in a magnetic field (Section 12-4). In particu¬ 
lar, Eqs. (12-45) and (13-32) should be compared. Figure 12-4 is also a 
plot of l/Ma (ordinate) against t (abscissa). 

The present one-dimensional problem is a special case of the model 
discussed at the beginning of Section 7-4 using different independent vari¬ 
ables. The connection in notation is l a = —a, If = +a, and j„ = j t = j. 
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By the same methods as for the three-dimensional case, we find 
(Problem 13-9) 

mj$) = exp [~al tanh_1 (m) *] a3 ~ 34 > 

= cosh" (tanh- 1 exp (- ^ tanh" 1 ^) (13-35) 

-expj—+ - (13-36) 

and = Ma 2 . 


13-2 Gaussian probabilily distribution for free polymer molecules. In 
this section we discuss further the gaussian probability distribution for 
free (r = 0) polymer molecules with the usual property lo = 0. Since the 
whole section is concerned with free chains, we drop the subscript zero on 
ii.it % ®tC. 

We saw in the preceding section that if one end of a long polymer 
molecule is chosen as origin and the other end is forced to lie on a pre¬ 
assigned line passing through the origin, say the x-axis, then according to 
Eq. (13-29), the probability that the ends of the molecule will be separated 
by a distance l is proportional, for l not too large, to exp (—l 2 /2l 2 ). Since 
the direction of the preassigned line is arbitrary, we can make the equiva¬ 
lent alternative statement that if one end of a polymer molecule is chosen 
as origin, the probability that the other end will lie in a specified volume 
element dxdydz, a distance r from the origin (Fig. 13-1), is proportional 
to 

e —r*/2I* fa fly fa 


As a next step, we can conclude that if one end of a polymer molecule is 
chosen as origin, the probability that the other end is at a distance between 
r and r + dr, irrespective of direction, is 


P(r) dr = -* = (2wi 2 )- 8/2 e- ,,/a? 4irr 2 dr. (13-37) 

This probability is normalized to unity. The average values of r 2 and r 
are 


-f 


r 2 P(r) dr = 3J 2 , 


(13-38) 
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Fig. 13-3. Chun with fixed angle 0 between units. In part (b), if the left 
unit is on the 2 -axis, the end of the right unit (point B) can be anywhere on the 
dotted circle. The angle <p is measured from some fixed point A. 





(13-39) 


Thus 1 = 0 but f > 0 (l can be positive or negative, but r is always 
positive). Using Eq. (13-38), Eq. (13-37) takes the more appropriate 
form 

P(r) dr = dr - (13-40) 


Equation (13-40), it will be recalled, follows from the very general 
equation (13-29) and is therefore not restricted to any particular model. 
In various special cases, an explicit expression can be given for r 2 . For 
example, for the freely jointed chain of Section 13-1, l 2 = Ma 2 / 3, and 
hence r 2 = Ma 2 . We now list, without proof,* some further results for 
idealized models of polymer molecules, which, however, are considerably 
more realistic than the freely jointed chain. 

(1) If the chain has M units or bonds of length a, and 0 is the fixed 
bond angle between successive bonds (Fig. 13-3a), and if rotation about 
bonds (see the angle <p in Fig. 13-3b) is free, then for large M, 


Ma 


2 (1 ~ COS 0) 
(1 + COS 0) 


(13-41) 


The tetrahedral angle 0 = 109.5° is the case of most interest: r 2 = 2Afo 2 . 
If 6 = 90°, r 2 = Ma 2 , as for a freely jointed chain. 


* See Flory, pp. 414-422, for more details. 
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(2) If fixed bond angles and 6 2 alternate (e.g., 0—Si—0 and Si—0—Si 
in the silicone chain), then 


1 _ u 2 (1 - cosfli)(l — cos 6 2 ) 
1 (1 — cos 6\ cos flj) 


(13-42) 


for large M. 

(3) Here we have the same situation as in (1) except that rotation about 
<p is hindered (see Section 9-5). For a hindering potential V(<p) which is 
symmetrical about tp = 0, 


^ = Ma 2 (L-_^g)-(l± C08 -*> (13 _43) 

(1 + cos 8)(l — cos <p) 


for large M and cos <p not too near unity, where 

f cos <pe~ Vi,)lkT d<p 

cos ip = —- 

f r e~ VMlkT dip 
Jo 


(13-44) 


If V(ip) s 0 or if F(<p) = V(ip + 2irm -1 ), where m> 2, as in Eq. (9-18), 
then cos^ = 0 and Eq. (13-43) reduces to Eq. (13-41). However, actual 
polymer chains will not have this symmetry, and the cos^> correction will 
be significant. 

Although the models leading to Eqs. (13-41) through (13-43) are 
much more realistic than a freely jointed chain, they still cannot be taken 
too seriously. For example, bending and stretching of bonds have not 
been taken into account. Much more important, van der Waals (or other) 
attractions and, especially, repulsions between different units of the chain 
have been ignored. The neglect of van der Waals repulsions enters all the 
above models with the implicit assumption that the chain has a length 
but no thickness. Because of this complication alone, the polymer con¬ 
figuration problem differs significantly from ordinary random-walk 
problems: in a given polymer configuration, two parts of the chain cannot 
cross each other (occupy the same space), but there is no such restriction 
on random-walk (or diffusion) paths. In polymer language, this is called 
the excluded volume problem, and much recent theoretical work has been 
done on it.* _ 

For the above reasons, detailed theories providing expressions for r* 
in terms of a model are not very practical. Instead, one can regard r 2 


* See, for example, F. T. Wall andJ. J. Erpenbeck, J. Chem. Phya. 30, 634, 
637 (1959). These authors find that r 2 « M*, where 6 = 1.18 for a tetrahedral 
lattice. 
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in Eq. (13-40) as an empirical quantity to be determined by some physical 
property of the polymer molecules that can be related to r 2 . 

An approximate jsemiempirical device that may be used to relate a real 
chain of unknotm r 2 to the simplest model above, the freely jointed chain, 
is the following. Bond angle restrictions exist between one monomer and 
the next in a real polymer molecule. But if we call, say, five or ten (de¬ 
pending on the stiffness of the chain) monomers one “statistical unit," 
then the (end-to-end) direction of one statistical unit is essentially in¬ 
dependent of the direction of neighboring statistical units in the chain. In 
fact, enough monomers are included in a statistical unit to ensure this 
independence. Thus we can replace the actual restricted chain of mono¬ 
mers by an equivalent chain of freely jointed statistical units. If M is 
the number of monomers in the chain and n the number in a statistical 
unit,, then the number of statistical units is M' = M/n. The length of a 
statistical unit, a', is estimated as the root mean-square end-to-end dis¬ 
tance of a statistical unit (i.e., a chain of n monomers). Then, finally, in 
Eq. (13-40) we put _ 

r 2 = Af'o' 2 , (13-45) 

as for a freely jointed chain. The excluded volume problem is ignored 
here. 

If r 2 is known, then M' and o' can_be chosen in a unique way so that not 
only does the product MV* equal r 2 but also so that the fully extended 
length of the effective freely jointed chain, M'a’, is equal to the fully ex¬ 
tended length of the real chain, L..-. , That is, from the equations 



r 2 = M'a' 2 

and 

4n« = M’a’, 


we deduce 

* 

II 

MT” 

and 

II 

fl-* | 

8 1 “ l 

(13-46) 


Again the excluded volume problem is ignored. 

13-3 Rubber elasticity. Rubber consists of an isotropic network of 
long polymer chains. The space-filling property of the chains, referred to 
in Section 13-2 in connection with the excluded volume problem, is im¬ 
portant here, for rubber is a condensed phase with some liquidlike proper¬ 
ties. A rather good analogy to a sample of rubber is a large tightly packed 
collection of very long actively wiggling worms, with each end of each 
worm attached to one end of each of three other worms (to form a network). 
The junctions joining the ends of four chains (worms) together are called 
cross-links. A real network will of course have imperfections (chains with 
free ends, etc.) just as a real crystal has imperfections. 
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It is commonplace that rubber has rather unique elastic behavior. This 
behavior is a consequence of the special configurational properties of 
polymer molecules considered in the preceding sections. We shall give 
here only a very brief and semiphenomenological discussion of rubber 
elasticity. An adequate treatment of the details of polymer network 
theory would take us far beyond the scope of this book. The reader inter¬ 
ested in this subject should see the papers of James and Guth (Supple¬ 
mentary Reading list). The alternative, simpler, but less satisfactory 
theory of rubber elasticity, due to Wall, will be presented in Chapter 21. 
This latter theory provides the starting point for the only existing theories 
of polymer and polyelectrolyte gels, etc. 

Let us begin by summarizing the observed thermodynamic behavior 
of rubber for extensions up to the order of 300%. First, rubber is approxi¬ 
mately incompressible (as are typical liquids); when rubber is stretched, 
the volume stays almost constant. We can therefore use the following 
rather accurate thermodynamic equations for a sample of rubber of definite 
mass (L = length of sample): 


dE = TdS + rdL, 

(13-47) 

dA = -SdT + rdL, 

(13-48) 

7 = (l z\ = Te + Ts ’ 

(13-49) 

(se\ 

Tb = \m,) t ’ 

(13-60) 


(13-51) 


where tb and ts are the energy and entropy contributions to the force r. 
By measuring rasa function of both L and T, ts can be calculated from 
(dr/dT)t, and hence te can be obtained from Eq. (13-49). It is found 
in this way that te is approximately zero: the elasticity of rubber is an 
entropy effect. Thus E depends on T but not on L. The implication of 
this is that when rubber is extended, the intermolecular potential energy 
remains constant, which is not surprising for a condensed phase of con¬ 
stant volume, and also that the extension is made possible by sufficient 
uncoiling of the polymer chains but does not involve any bending or 
stretching of chemical bonds. This behavior is equivalent to that of an 
ideal gas: E is a function of T but not V; and in the equation analogous 
to Eq. (13-49), p = Pb + Ps, Pb = 0. 
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An alternative and equivalent experimental observation is that the force 
r is directly proportional to T at constant L. From the relation 

03 - 62 ) 

we conclude, then, that ( dS/dL)r is a function of L only. This is consistent 
with a split of the entropy into two parts: 

S = 3i(D + S 2 (L), (13-53) 

where <S a (L) is the entropy associated with the configurational degeneracy 
of the polymer chains of the network. Again there is an analogy with an 
ideal gas: (dS/dV)r is a function of V only; S — Si(T) + <S a (F)- For an 
ideal gas, replace, in Eq. (13-52), t by p, Ts by ps, and —(dS/dL )r by 
(dS/dV)r = Nk/V. 

On the basis of the above discussion, we postulate that the essential 
molecular mechanism determining the elasticity of rubber is the elasticity 
of the individual chains making up the network, and this in turn is de¬ 
termined by the configurational properties of the chains (Section 13-2). 
We have to superimpose on this postulate the facts that the volume is 
constant on stretching and that a hydrostatic pressure exists in the rubber, 
just as in any liquid. 

Consider an isotropic cube of rubber, with edge Lo, when under no 
force. The volume is V = L%. Now let a force t extend the rubber in the 
^-direction so that L ■ L x > Lo- Then 

L v = L„ V — L% — LLyL, = LL%. (13-54) 

Let us examine the mechanical equilibrium at a surface of the stretched 
rubber perpendicular to the 2 -axis. There is an outward force pLL y owing 
to the hydrostatic pressure, but this is just balanced by the inward force 
of the molecular chains. We cannot write a satisfactory and completely 
explicit expression for this inward force without a detailed study of the 
properties of the network. However, for the small extensions we are in¬ 
terested in, we can deduce from Eq. (13-17) for a single chain that the 
inward force exerted by a network of N chains will have the form CNkTL t , 
since L, will be proportional to l, for a single chain. Here, C is a constant 
which depends on the structure of the network. On equating the inward 
and outward forces, and putting L, = Ly, we find 


CNkT 

P = — 


(13-55) 
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Fig. 13-4. Comparison of experimental and theoretical length-force rela¬ 
tions for rubber in a typical case. The vertical scale has been adjusted to give 
best fit. 


Next, consider the mechanical equilibrium at the surface (perpendicular 
to the x-axis) which is being pulled by an external force r. Here r + pL\ 
(outward force) is balanced by CNkTL (inward force). Then 


r = CNkTL - pLl 

From Eqs. (13-54) and (13-55) this becomes 

r = CNkTLo (« - ’ 


(13-56) 


(13-57) 


where a = L/L 0 . This is the desired length-force equation, valid for 
small (up to about a = 3) extensions. Of course a = 1 when r = 0. 
The initial slope, (dr/da) a -u is ZCNkTL 0 ■ Experiment and theory 
are compared in Fig. 13-4, where the vertical scale has been adjusted to 
give the best fit. The experimental “knee” is well reproduced by the theory. 
Deviations occur, as expected, at high extensions. 

In the Wall theory of Chapter 21, C = Ljj" 2 . 



230 POLYMER CONFIGURATION AND RUBBER ELASTICITY [CHAP. 13 


Problems 

13-1. Show that the function j(x, T) for a unit in a three-dimensional freely 
jointed chain is independent of x in the range —a < x < + a. (Page 217.) 

13-2. Derive Eq. (13-15) for the fluctuation in length of a chain under a 
constant force. (Page 218.) 

13-3. For any molecule with To = 3i$ = • • • — 0, show from Eq. (13-19) 
that 

Invert this series to get r/kT in powers of 7, then integrate dA « r <fi to find 


Qd,T) 

<2(0, T) 



(13-59) 


Equation (13-59) is exact only for M —*• », but Eq. (13-58) is exact in general. 
By comparing Eqs. (13-12) and (13-58), show that, for a freely jointed chain, 

(Page 219.) 

13-4. Deduce the expansion of JB -l (») from that of £(y) in Eq. (12-31). 
(Page 220.) 

13-5. Use thegaussian form for Q(t) (Eq. 13-29) to deduce A from Eq. (13-6) 
and 7 from Eq. (13-10). The result should agree with Eq. (13-28), of course. 
(Page 220.) 

13-6. Derive the length-force equation, (13-14), from the canonical ensemble 
equations (13-4), (13-5), and (13-23). (Page 220.) 

13-7. Deduce the probability expansion (13-22) from Eq. (13-27). (Page 221.) 
13-8. Deduce the entropy equation (13-30) from Eqs. (1-33) and (13-29). 
(Page 222.) 

13-9. Derive Eqs. (13-34) through (13-36) for a one-dimensional freely 
jointed chain. (Page 223.) 

13-10. Obtain the equivalent of Eqs. (13-60) in Problem 13-3 for the one¬ 
dimensional freely jointed chain. 

13-11. Discuss the problem of a two-dimensional freely jointed chain. 

13-12. Derive Eq. (13-33) as a special case of Eq. (7-59). 

13-13. Discuss the problem of a three-dimensional freely jointed chain in 
which each unit can have two lengths, a a and ag, with partition functions j a (T) 
and jf(T) [in the notation of Eq. (13-11)]. Consider also the problem in which 
each unit can have any length between a «= 0 and a «= a* with equal prob¬ 
ability. Incidentally, in an equivalent chain of statistical units, a gaussian dis- 
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tribution in a for the length of a statistical unit would be an appropriate approxi¬ 
mation (in the text, we use a single length o'). 

13-14. Calculate 1% and 1$ from the gaussian function (13-23). Compare with 
Problem 13-3. 

13-15. Show the identity of Eqs. (13-34) and (13-35). 

Supplementary Reading 

Chandrasekhar, S., Revs. Mod. Phys. 15, 1 (1943). 

Flory, Chapters 10 and 11. 

James, H. M., and E. Guth, J. Chem. Phys. 11, 455 (1943); J. Polymer Sci. 4, 
153 (1949). 

Tbeloab, L. R. G., The Physics of Rubber Elasticity. Oxford: 1958. Chap¬ 
ters 1-6. 

Wilson, Chapter 14. 
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CHAPTER 14 


LATTICE STATISTICS 

In Part III we discuss systems of molecules that exert intermolecular 
forces on each other. First we consider, in the present chapter, molecules 
confined to sites in a lattice (the “Ising problem”), and then turn in later 
chapters to mathematically more complicated systems (imperfect gases, 
liquids, etc.) in which the molecules are not restricted in this way. A 
feature of particular interest in all this work is the possibility of phase 
transitions (e.g., gas-liquid) when the temperature is low enough. 

We have already considered, in Chapter 7 and in Sections 12-4 and 13-1, 
a number of problems involving lattice statistics. In all these cases, how¬ 
ever, the individual sites or subsystems were independent of each other. 
In this chapter we extend the discussion to more complicated problems in 
which interactions between nearest-neighbor sites or subsystems exist. 
Second-neighbor, and higher, interactions are important in some cases 
but, for simplicity, we shall confine ourselves to models with nearest- 
neighbor interactions only. 

Further applications of the results obtained in this chapter will be found 
in Chapters 16 (hole theory of liquids) and 20 (lattice solution theory). 

A more detailed and advanced review of this subject is available in 
S. M., Chapter 7. 

14-1 One-dimensional lattice gas (adsorption). We consider here the 
following model: we have a linear array of M equivalent sites (M —* oo); 
each site may be empty or occupied by one molecule; the partition func¬ 
tion of an isolated molecule on a site is q(T); and when two nearest-neighbor 
sites are both occupied by molecules, there is a potential energy of inter¬ 
action w between the molecules (in some cases to is a free energy, a function 
of T, and not a potential energy; also, the zero of energy for w is infinite 
separation). This is a realistic model for the adsorption of molecules or ions 
onto sites on a linear polymer molecule if the forces between the adsorbed 
molecules or ions are of sufficiently short range that only nearest-neighbor 
interactions need be taken into account. This might well be the case even 
for ions, in the presence of added electrolyte (see Chapter 18). The model 
is the same as that of Eqs. (7-3) through (7-16) except for the added 
complication here of the nearest-neighbor interaction energy w. 

This one-dimensional problem has a special added importance in statis¬ 
tical mechanics by virtue of the fact that it can be solved easily and exactly. 
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An exact treatment has been achieved with great difficulty for the corre¬ 
sponding two-dimensional problem only in the special case that half the 
sites are occupied; it has not been achieved at all for the three-dimensional 
problem (except for various series expansions). 

We shall use the canonical ensemble and the maximum-term method. 
A much more elegant matrix method is available (see S. M., pp. 312-314 
and 323-324), but we avoid this in deference to readers not sufficiently 
equipped mathematically. The system is characterized thermodynamically 
[see Eq. (7-6)] by M sites, of which N are occupied, and the temperature T. 
Equation (7-4) gives the canonical ensemble partition function Q(N, M, T) 
when w — 0. Our task is to generalize this Q for to 5 * 0. 

When the N molecules are distributed among the M sites in a particular 
configuration or arrangement with N lx nearest-neighbor pairs of sites 
both occupied, the interaction potential energy is AT n w. Actually, it 
proves convenient later to use here, instead of Nu, the variable N ox —the 
number of nearest-neighbor pairs of sites with one site empty (subscript 0) 
and one site filled (subscript 1). The relation between Nu and N 0 i can 
be established by the following argument. If we draw a line from each 
occupied site to its two neighboring sites, we will have drawn 2 N lines. 
Also, in this process, we will have placed two lines between each nearest- 
neighbor 11 pair (one starting from each side) and one line between each 
01 pair. Therefore, 

2N = 2N X1 + N 01 . (14-1) 


A similar argument for empty sites gives 


2(M — N) = 2N 0 o + Nov (14-2) 

We see from Eqs. (14-1) and (14-2) that only one of Nu, N 0 i, and N 00 
is independent; we choose Nqv End effects are neglected in Eqs. (14-1) 
and (14-2) because ilf —» 00 . 

A configuration or arrangement of N molecules on M sites with Noi 
pairs of type 01 will have an interaction potential energy 

Nuu> = (n- ^)w, 

according to Eq. (14-1). Suppose that there are altogether g(N, M, N 0 i) 
configurations with exactly N 0 i pairs of type 01. That is, suppose there 
are g(N, M, N 0X ) different ways in which N molecules can be distributed 
on M sites giving N 0 1 pairs of type 01. The contribution of these con¬ 
figurations to Q is g(N, M, Noi)e~ NllWlkT , and the complete expression 
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for Q is then 

Q(N, M,T) = q N J2 9W, M, N n ) exp [- (n - ^ «/w] 

= (qe~ alkT ) N X) 0 (N, M, N 0 i)(e wl2kT ) N,>l , (14-3) 

Noi 

where the sum is over all possible values of N 0 1 for given N and M. 

Having related Q formally to g, our next problem is to find an explicit 
expression for g. We might note at the outset that we must have for the 
total number of configurations with given N and M , 

g «*■*■»•«■>- m < 14 -» 

In view of this relation, it is clear that Eq. (14-3) reduces, as it should, to 
Eq. (7-4) when w = 0. 

Since we shall be using only the maximum term in the sum in Eq. 
(14-3), N, M, and N ol may all be regarded as very large numbers. For 
concreteness, suppose N Ql is odd and that the site on the left of the linear 
array of sites is of type 1. For example: 

llljoojljOjljOOjlljO 
N = 7, N 0 i = 7, M = 13, M — N — 6. 

Then there are (N 0 1 + l)/2 groups of l’s, (iV 0 i + l)/2 groups of 0’s, 
and the left-hand group is a 1 group, while the right-hand group is a 0 
group. These remarks follow from the fact that a 01 pair occurs at each 
boundary between a 1 group and a 0 group. Now consider the number of 
ways of arranging N l’s in (2V 0 1 + l)/2 groups. Each 1 group must have 
at least one site of type 1 in it; thus the required number of arrangements 
is the number of ways of assigning the remaining N — l(N 0 i + 1)/2] (=X) 
l’s among the (JV 0 1 + l)/2 (= Y) groups, with no restriction on the number 
of these l’s per group. This number is 

(X + Y — 1)1 AT! 

(Y - 1)LXT! “ (JV 0 i/2)![iV - (N 0 i/2)]! ’ 

where we have dropped unity compared with laige numbers in the second 
expression. The corresponding number for the 0’s is obtained by replacing 
N by M — N. Then g is twice the product of the above number for the 
l’s and the corresponding number for the 0’s. The factor of two arises 
because the left group could as well be a 0 as a 1. But this factor is neg* 
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ligible, for we use only In g and not g itself. Thus we have finally 

u v , _ Nl(M - N)\ _ 

oi) ~ [N _ (i\T 01 /2)]Pf — N — (iV 0 i/2)]!l(Ar 0 i/2)!P' 

(14-5) 

This formula for g is now inserted in Eq. (14-3) for Q. But the sum is 
difficult, so we use the maximum-term method (Appendix II). Let 
tiffou N,M,T) = g(e wl2kT ) y ° l in Eq. (14-3). Then from the condition 


d In < _ 3 In . to 

3N 0 i ~ dN 0 i + 2 kT 

and Eq. (14-5), we find 

(9 — a)(l — 9 — a) 

^ " 6 


with 




(14-6) 


(14-7) 


where is the value of N 0 1 giving the maximum term in the sum in 
Eq. (14-3). Equation (14r-7) is a quadratic equation in a, and gives N* t 
as a function of N, M, and T: 


_Nt i _ 29(1 - 9) 
a ~ 2M ~ 0 + 1 ’ 

where 

0 = [1 - 49(1 - 9)(1 - e~ alkT )] 112 . 


(14-8) 


The sign on the square root can be determined from the special case 
w/kT = 0. Before proceeding to find the thermodynamic functions of the 
system from Q, let us digress to note that Eq. (14-7) can be rewritten as 


NiiNto e~ alkT 
N*f ~ 4 


(14-9) 


if we use Eqs. (14-1) and (14-2). This has the form of a chemical equi¬ 
librium quotient, as in Eq. (10-5), for the “reaction” 


2(01) (11) + (00). (14-10) 


The “equilibrium constant” e -w/ * I ’/4 is consistent with the “partition 
functions” g 0 o = 5 n =^~ wlkT , and g 0 i = 20(configurational de¬ 
generacy). That is, e~" lkT /4 = 9iig 0 o/?oi- ° 
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The chemical potential follows from 


In Q = N In qe- a,kT + In S,, N, M, T) (14-11) 

and 



The last term drops out because of Eq. (14-6). Then 


_M_ _ 
kT ~ 



From this and Eq. (14-6) we find 


\qe 


,-wlkT 1 ~ 

e 


te)' 


(14-12) 


where X = e* lkT as usual. Finally, we eliminate a in Eq. (14-12) with 
the aid of Eq. (14-8), and obtain 


„ xae~ wlkT ~ ~ 1 + 26 
V Me “0 + 1-2 9 


(14-13) 


This equation gives X as a function of 0 and T. In the case of adsorption 
or binding, X is (at least approximately) proportional to the equilibrium 
gas pressure or solute concentration [see, for example, Eq. (7-9)]. Hence 
this is the "adsorption isotherm. ” Equation (14-13) reduces to the Lang¬ 
muir equation, (7-9), when w/kT —* 0. Figure 14-1 gives 9 plotted against 



Fio. 14-1. Exact adsorption isotherm for one-dimensional lattice with 
nearest-neighbor interactions. The w/kT =» 0 case is the Langmuir isotherm. 
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In y for w/kT = 0 (Langmuir) and —2.4. The symmetry evident in 
Fig. 14-1 can be expressed analytically (Problem 14-1) by 

y(e)y(l -0) = l. (14-14) 

Next, we find the equation of state. We could use $ = kT d In Q/dM 
(Eq. 7-7) but instead, with n available, we employ 

kT = ~ Mkf = Hfik + M ]nQ ‘ (14-15) 


From Eqs. (14-8) and (14-11), we find 


JL 

kT 


0\n J+Y=f e + In 9(N, M, N* 0l ) + 


20(1 - 0) 

0 + 1 


w 

kT' 


Finally, we use Eqs. (14r-5) and (14-8) to eliminate Njj from In g. Lengthy 
algebra leads to 


A 

kT 


= In 


0 + 1 


0 + 1 - 2 8 


(14—16) 


This reduces to Eq. (7-7) (Langmuir) when w/kT -* 0. 

For a first-order phase transition to appear in this system, we must 
find the familiar van der Waals type of loop in the equation of state, 
(14-16), below a critical temperature. From the symmetry of Fig. 14-1, 
it is clear that if a critical point exists, the critical value of 8 is 1/2. The 
usual equation-of-state p-v plot corresponds here to a &-1/6 plot 
(Problem 14-2). In a van der Waals loop, then, d$/d(l/8) will be positive 
at 8 — 1/2; or, d$/dd will be negative at 8 = 1/2. But we deduce from 
Eq. (14-16) that 

= 2e vl2kT . (14-17) 

\ 08 


This quantity can never be negative, so a loop and first-order phase transi¬ 
tion are not observed. If to is negative and T —* 0, the derivative ap¬ 
proaches zero. Therefore the “critical point” is at 8 e = 1/2, T e — 0. We 
shall see in Sections 14-3 and 14-4 that two- and three-dimensional lattice 
systems do lead to first-order phase transitions (that is, with T e > 0). 

Incidentally, it is easy to show from thermodynamics (Problem 14-3) 
that 
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and therefore, in Fig. 14-1, 


' d8 \ 
hi y/#- 1/2 


—v>l2kT 

c 

4 


This slope is steeper than the to = 0 slope if w < 0, but less steep if 
w > 0. If w < 0 and T —*• 0, the slope approaches -f-oo. 


14-2 Elasticity of a linear polymer chain. In the first part of Section 
7-4 we studied a linear chain of a and £ units, with lengths of units l a and 
If. We reconsider the same problem here, but now we allow interactions 
between nearest-neighbor units in the chain. The interaction energies 
(hydrogen bonds, for example) are, in obvious notation, w m , w a9 , and w 9 f. 
This model, and extensions of it, has been applied to the a-0 transformation 
in fibrous proteins, and to the helix-random coil transformation in protein 
and nucleic acid molecules in solution (force on chain = t = 0). We shall, 
however, not go into the details of these applications. 

Equation (7-57) is again the thermodynamic starting point. In place 
of the canonical ensemble partition function (7-58), we must use the 
equivalent of Eq. (14-3). We let g(M a , M, N at ) be the number of ways in 
which M a a units can be distributed among M positions on the chain so that 
there are nearest-neighbor pairs of type aff. This is obviously the same 
function as in Eq. (14-5), except for notation. In any one of these g 
configurations, the total interaction potential energy W is 

W = N aa W aa + N«t/Waf + NfftWffff 

= - + M a v> aa + (M - M a )Wffff, (14-18) 

where we have used Eqs. (14-1) and (14-2) and defined w by 

W = Wa* + Wffff — 2w«ff. (14-19) 

The energy w is the change in energy (or free energy if the w’s are functions 
of T) in the process (14-10). This composite w plays the same role as w 
in the preceding section. Therefore the problem is not complicated in a 
fundamental way by the occurrence here of three different w’s. Then, 
instead of Eq. (7-58), we have 


Q(M a , M,D = £ g{M„, M, N a ff)e~ wlkT 


= 0>- w « /tr ) Jf «0>' w » /tr )"“^ £ 9(Ma, M, Naff)(e'° l2kT ) ff < 

"of 


(14-20) 
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This is now in essentially the same form as Eq. (14-3). Equations (14-4) 
through (14-10) apply here without change (except notation). 

For the force on the chain we find, just as in Eq. (14-13), 


r(h i — l«) 
kT 


_(dj*Q\ _ J/ 0+1-2A 

~ \9M m ) M , T - “Lv - 1 + 25/ J (1 ^ 21) 


where 6 — MJM (fraction of a units) and 9 is replaced by 5 in the func¬ 
tion 0(0, T). Figure 14-1 is a plot of S (ordinate) against 


T(h ~ U 
kT 


+ ln 






jfie- w »l kT 


S In y. 


(14-22) 


A negative w means here that aa and 00 pairs of units attract each other 
more than do a0 pairs. This leads to a tendency for the system to split 
into two phases (predominantly a or predominantly 0), a tendency which, 
however, cannot be realized in a one-dimensional system, as pointed out 
in the previous section. In Fig. 14-1, as r —► co, In j/ —> —oo, and S —» 0 
(all long units). 

For free molecules (that is, r = 0), the equilibrium value of S is deter¬ 
mined by Eq. (14-21) with r == 0. This value of 5 is a function of T, a 
dependence which has been investigated experimentally for the a helix- 
random coil transition in proteins by Doty and collaborators. The corre¬ 
sponding equation when the id’s are all zero is (7-61). 


14-3 Two-dimensional square lattice. Here we use the lattice gas 
language again, since the only practical application is to adsorption on a 
surface. The model is the same as in Section 14-1, except that the sites 
form a two-dimensional square lattice instead of a one-dimensional lattice. 
In particular, q is the partition function of an adsorbed molecule on a site, 
and w denotes the interaction energy between two molecules on nearest- 
neighbor sites. Some of the exact properties of this system have been 
found, especially by Onsager. These are confined mostly to the special 
condition 0 = 1/2, but by symmetry [juBt as in Eq. (14-14)], it can be 
deduced that 0 e = 1/2 if a critical point exists, so this is the most inter¬ 
esting value of 0. Because of the advanced mathematical techniques 
needed for this problem, we shall confine ourselves to a brief summary, 
without proof, of some of the results obtained. References and more 
details will be found in S. M., Chapter 7. The same problem is treated by 
approximate methods in Sections 14-4 and 14-5. 

The first result we mention is that the two-dimensional lattice gas can 
exist in two phases, a dilute phase ("gas”) and a condensed phase ("liquid”), 
and below a critical temperature the two phases can be in equilibrium with 
each other. The transition from one phase to the other involves a latent 
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Fig. 14-2. Exact two-phase region for two-dimensional lattice gas with 
w < 0. Dotted curves are schematic isotherms. 


heat. In other words, the model predicts a first-order phase transition 
under suitable conditions. The critical density is 0 C = 1/2, and the 
critical temperature (first found by Kramers and Wannier) is given by 
x e = -\/2 — 1 = 0.4142, where x = e wl2kT and x e = e '° l2kTc . Thus to 
must be negative for a critical temperature to exist (Problem 14-4) and 
for a first-order phase transition to occur. A plot analogous to the usual 
p-v plot is given in Fig. 14-2 for to < 0. Inside the solid curve is the two- 
phase region. For a given temperature (i.e., value of x) below the critical 
temperature, the pressure at which the two phases are in equilibrium 
(i.e., the vapor pressure) is 


kT - In (1 + x 2 ) + 2t 




+ (1 


ft® sin® v) 1/2 ]} dip, (14-23) 


where 


*i 


4x(l - x®) 
(1 + x 2 ) 2 ’ 


Actually, Eq. (14-23) is more general than just indicated: # is the pressure 
for any value of x (w positive or negative) at 0 — 1/2 (e.g., any point on 
the dashed line in Fig. 14-2). Thus, if to = 0 (x = 1), $/fcT = In 2, in 
agreement with Eq. (7-7). 

Let $t be the value of 6, for given x, on the liquid side of the two-phase 
region (Fig. 14-2). Then for the same x, symmetry leads to 6g = 1 — 6l, 






244 


LATTICE STATISTICS 


[CHAP. 14 



\kT/-w 

Fig. 14-3. Exact configurational heat capacities for one-dimensional 
(curve I) and two-dimensional (curve 11) lattice gases. 


where Oq is the corresponding value of 0 on the gas side. The dependence 
of 0 a on x is given by 


e G (x) = \ 


1 f(l + s 2 )(l - 6x 2 + * 4 ) 1,2 1 1/4 

2 L (1 - * 2 ) 2 J 


(14-24) 


The contribution of nearest-neighbor interactions to the energy of the 
system is called the configurational energy, E eon n t = N* x w. We assume 
w is independent of T here. For 0 = 1/2, Onsager obtained explicit 
expressions, which we shall not write down, for 2? 00 nfig and the heat 
capacity C con fi g = dE con n g /dT, involving elliptic integrals. Figure 14-3 
shows C C onfig plotted against T. This is the configurational heat capacity 
along the dashed line in Fig. 14-2. As the temperature T e is crossed, a 
singularity in C CO niig occurs. It is easy to show (Problem 14-5) from 
Section 14-1 that in one dimension (6 = 1/2) 


m* ~ ( 4 kT Sech 4 kl) ' (14-25) 

This function is also plotted in Fig. 14-3; it does not have a singularity. 

Many exact series expansions have been found, not only for the square 
lattice, but for other two- and three-dimensional lattices as well. As an 
example (see also Problem 14-17), we derive the second virial coefficient 
B 2 (T) in the expansion of 4>/fc7’ in powers of 6 for an arbitrary lattice with 
c nearest-neighbor sites to a given site (for example, c = 4 for a square 
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lattice). The definition of the nth virial coefficient B„ is obvious from the 
expansion 

-pf = 6 + B 2 (T)6 2 + B 9 (T)8 3 + • • • • (14-26) 

We use the method suggested by Eq. (1-69), since it reduces the problem 
to a two-body problem. For an arbitrary lattice, Eqs. (14-1) through 
(14-3) become 

cN = 2N n + Not, (14-27) 

c(M - N) = 2N 0 o + N ou (14-28) 

Q(N, M,T) — (qe~ ew,2kT ) N £ gW, M, N 01 )x n °', (14-29) 

Noi 

where x = e vl2kT as before. The grand partition function is 

S = e* u ' kT = 1 + Q(l, M, T)\ + <2(2, M, T)\ 2 + • • •, (14-30) 

where X = e l ‘ lkT . When N = 1, the only possible value of N 01 is c, and 
1/(1, M, c) — M, since the one molecule can be on any of the M sites. Thus 

<2(1, M,D — qe~ cw/2kT Mx c = Mq. (14-31) 

When N = 2, which is as far as we have to go for B 2 (T), N 0 i can have the 
values 2c or 2c — 2. In the former case the two occupied sites are not 
nearest neighbors to each other, but in the latter case they are. Then 

g( 2, M, 2c) = M(M ~ 2 C ~ — > g(2, M, 2c - 2) = ^ • (14-32) 

The first result, for example, follows from the fact that the first molecule 
can be placed at any one of M sites and then the second can be placed at 
any one of M — (c + 1) sites to avoid “contact." Note that Eqs. 
(14-32) satisfy Eq. (14-4). Equations (14-29) and (14-32) give 

<2(2, M,T) = q 2 M ^ CX " 2 + ^ - c ~ l ). ( i4_33) 

The grand partition function, (14-30), becomes then 

S = e* ulkT = 1 + Ms + M (“ 2 + M ~9~ l \ ? 2 + ..., (14-34) 


where we define 


e as q\. 


(14-35) 
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If we take the logarithm of both sides of Eq. (14-34), we get 


4 > _ i 

kT~ z + 


(ex 2 — c — 1) 


2 2 + 


(14-36) 


Since Mz is of order M, the legitimacy of using the expansion of 
In (1 + Mz + • • •) here is questionable, to say the least. Actually, the 
result can be justified by another argument, which we give later in Sec¬ 
tion 16-1. Next, we use the thermodynamic equation 37 dp. = M d& 
(T constant), or 

to obtain from Eq. (14-36), 

0=2 + (cx~ 2 - c - l)s 2 H-. (14-38) 


Since, by definition, z is proportional to X, and 2 —► 0 as 0 —► 0, it is ap¬ 
propriate to call 2 an “activity” in the chemical thermodynamic sense. 
The inverse of Eq. (14-38) is 

2=»+(« + l- <a“ 2 )0 2 H-. (14-39) 


Finally, we use Eq. (14-39) to eliminate z from Eq. (14-36) and obtain 
$/kT in powers of 0, as in Eq. (14-26). We find 


Bz(T) = 


c + 1 — cx 2 
2 


c + 1 - ce~ wlkT 
2 


(14r-40) 


This agrees with Eq. (7-7) when w = 0 (x = 1). The second virial coeffi¬ 
cient can become negative if w/kT is sufficiently negative (B 2 —> —cx~ 2 / 2). 
On the other hand, if w/kT is large and positive, B 2 —* (c + l)/2 (two 
molecules cannot occupy nearest-neighbor sites). 

The method we have used here can be extended to B 8 , B 4 , etc., and has 
very general applicability in statistical mechanics.* We shall encounter 
it again several times in this book (Chapters 15 and 19). 


14-4 Bragg-Williams approximation. No two- or three-dimensional 
lattice statistics problem has as yet been given a treatment that is both 
complete and exact. Hence, approximate methods are useful. The present 
approximation is probably the simplest possible that retains the correct 
qualitative features. It is equivalent to the van der Waals approximation 
for imperfect gases and liquids (Chapter 16). 


* T. L. Hill, J. Chem. Phyt. 27, 561 (1957). 
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In the Bragg-Williams approximation for a lattice gas with nearest- 
neighbor energy w, the configurational degeneracy and average nearest- 
neighbor interaction energy are both handled on the basis of a random 
distribution of molecules among sites [i.e., as if w — 0, as in Eq. (7-4)]. 
This is obviously an incorrect procedure, except in the limit as T —* oo. 
Thus, instead of Eq. (7-4), we use 

M\a N e~** nV,kT 

Q(N, m,d= ’ < 14 ^> 

where Jtnw is the average interaction energy. We calculate 77n as 
follows: a molecule at a site has, on the average (random distribution), 
c6 = cN/M occupied nearest-neighbor sites next to it; therefore, 7Jn — 
(cN/M)(N/2) — cN 2 /2M, where the factor of two is inserted to avoid 
counting each 11 pair twice. The same partition function follows from 
Eq. (14-29) if we write 

E«<". »■ E»- ■ 


where TJo i = cN(M — N)/M (random distribution). 

The various thermodynamic properties may easily be derived (with 
to assumed constant) from Eq. (14-41). For example, 

InQ == - ^ = M In M - N\nN - (M — N) In (M - N) 

+ < 1M2 > 


| = ln« + T 

4 _ /dlnQ\ 
kT~\dMj 


/din Q 

) .. 

\ dT 

Jn.u 


cwd 2 

N,T 

2kT ~ 

~ 

,+ (; 


= In 


Ml 


N\(M - N)l 
- In (1 - e) 


+ 


N(la 


q+T 


+ 


JL 

kT 


(d\nQ\ 6e ve,,kT 

~ \dN (l-9)q’ 


d In q\ 
dT / '• 
(14r-43) 

(14-44) 

(14-45) 

(14-46) 


y m q\e~ ewl2kT 


j e «w(2»-l)/2*r 

i - e 


(14-47) 


The entropy is seen to be the same as for the ideal lattice gas (Eq. 7-11). 
Equation (14-44) is the equation of state, and Eq. (14-46) or (14-47) is 



Fig. 14-4. Pressure-volume isotherms for a Bragg-Williams lattice gas 
The curve labeled cw/kT — —4 is the critical curve. 


1.0 



Iny —»- 


Fig. 14-5. Adsorption isotherm for Bragg-Williams lattice gas. Thi 
cw/kT ■= 0 case is the Langmuir isotherm. 
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the adsorption isotherm [see Eq. (7-9)]. The second virial coefficient in 
Eq. (14-45) agrees with the exact relation (14r-40) in the limit as w/kT —> 0, 
as we should expect from the nature of the Bragg-Williams approximation. 
We have defined y above in such a way that the symmetry property 
(Eq. 14-14) y(6)y(\ — 0) = 1 holds (Problem 14-1). This definition of 
y is consistent with that in Eq. (14-13), where c = 2. 

Both Eqs. (14r-44) and (14-47) lead to loops (as in van der Waals 
equation) and to the prediction of a first-order phase transition (Figs. 14-4 
and 14-5). To draw in the horizontal stable equilibrium path in Fig. 14-4, 
we may use either the thermodynamic equal-area theorem or the symmetry 
condition 0 L = 1 — 0 a . By symmetry, the critical point must be at 
0 e = 1/2. We can locate the critical temperature from the condition 



We find cw/kTe = —4. The exact value for a square lattice (Section 14-3) 
is 4w/kT e = —7.05. The Bragg-Williams theory incorrectly predicts a 
phase transition for a linear lattice (c = 2). 

A number of phase transitions (vertical jumps) in adsorption isotherms, 
as in Fig. 14-5, have been observed experimentally. If we take (Problem 
14-4), say, w — —400 cal-mole -1 , then for a two-dimensional hexagonal 
lattice we have* T c = 77°K. At this temperature, however, the adsorbed 
phase is probably more nearly mobile than localized (Section 9-6) so a 
better model for the experimental systems is a two-dimensional gas-liquid 
transition (Sections 16-1 and 16-2). A system which is definitely localized, 
which shows a phase transition, and to which the Bragg-Williams theory 
has been applied successfully is the absorption of hydrogen gas (as atoms) 
by palladium metal.f The experimental critical temperature is T 0 = 
568°K. We might then roughly estimate w, using c = 0.6 X 12 = 7.2 
(0=1 corresponds to 0.6 H atom per Pd atom), from 

15.80Jfcr« w 6 , , _i 

w =-— X =-= = —2500 calmole . 

o l.i 

For consistency and convenience we are using throughout this chapter, 
except in Section 14-2, notation and language appropriate to a lattice gas. 
We could have used “magnetic language,” instead, as in Section 12-4. 
To illustrate this, we turn next to the Bragg-Williams approximation for a 
ferromagnet or antiferromagnet. 


* The exact value of &w/kT e lor a hexagonal lattice is —15.80. 
t See Fowler and Guggenheim, pp. 558-563, for details. 
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Section 12-4 should be reviewed for notation and thermodynamics. 
In a ferromagnet, the magnetic dipoles are associated with electron spin 
in atoms. Parallel spins or dipoles (tt or iJ.) at nearest-neighbor positions 
in the lattice are assumed here to have an interaction energy —J, while 
antiparallel spins (f 1) have an interaction energy +J. In Section 12-4, 
J = 0. These interaction energies are not simply dipole-dipole interac¬ 
tions, which are in fact negligibly weak by comparison, but are due instead 
to quantum-mechanical exchange forces, as in chemical bond theory. If J 
is positive, we have the ferromagnetic case, and if J is negative, we have 
the antiferromagnetic case. 

The partition function Q(M, T, H) of Eq. (12-40) has to be modified 
to take care of the nearest-neighbor interaction energies just referred to. 
For a given value of N (the number of 1 dipoles) in the sum in Eq. (12-40), 
we insert a factor e~ wlkT , where W is the total interaction energy: 


W = TfiiWu + "RoiWoi -J- tfooWool 

0=T, 1=1, Wn = Woo = —<f| w oi = 




cN 2 
2 M 


7?, 


01 


cN{M - N) 
~ M 


Foo = 


c(M - Ny 
2 M 


This gives 


W = — 


eJ(2N - M) a 
2 M 


(14-49) 


After putting the factor e wlkT t with the above W, behind the summation 
sign in Eq. (12-40), we can rearrange as follows: 


Q(M, T, H) 


| ?e (2mff+c/)/2fcTjAr ^ 


M 

E 

N-0 


j£] e 2cJN*IMkT[ e -2(mB+cJ)lkT^N 

N\(M - N)\ 

(14-50) 


Let us compare this with the grand partition function for the lattice gas, 
based on Eq. (14r-41): 


E= £Q(tf,M, T)\ n 

N 


^ M\e~ cN * wliUkT (q\) N 
Nl(M - AT)! 


(14r-51) 


The summation in Eq. (14-50) clearly has the same form as S in Eq. 
(14-51). Therefore, instead of rederiving results (Problem 14r-6) for the 
magnetic problem, we can merely transcribe those already found for the 
lattice gas. For example, since 

j, ZnNQ(N, 


(14-52) 
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the form of the sum in Eq. (14-51) suffices to determine "N/M as a func¬ 
tion of —cw/2kT and {X. This relationship has already been established 
in Eq. (14-47), using the canonical ensemble [which is the same as apply¬ 
ing the maximum term method to S in Eq. (14-51)]. Then, from the sum 
in Eq. (14-50) for the magnetic problem, 77/M must be the same function 
as above, but with 2cJ/kT and e ~ 2{mB+eJ)lkT as independent variables. 
Thus we have the correspondences 

- WT * W or W ** ~ 4J ’ (14-53) 

y = q\e~ cwl2kT *-» e ~ 2lmB+e />/H* e w/w or y c - 2 «H/tr 

(14-54) 

The relation (14r-53) also follows from Eq. (14-19). Therefore Fig. 14-5 
is also a plot, for the magnetic problem, of JJ/M — (1/2)[1 — (7/mAf)] 
(ordinate) against —2mH/kT (abscissa). 

The magnetic field strength is the analog of the chemical potential for 
a lattice gas, as pointed out at the end of Section 12-4. The related thermo¬ 
dynamic equations are 

d(<f>M) ss d(kT InS) = SdT + N dp + $dM (latticegas), 

-dA = d(kT In Q) = SdT + IdH - y.dM (magnet). 

When J > 0 (ferromagnetism), the critical (Curie) temperature is 
given by cJ/kT e = 1. Below the critical temperature, "spontaneous 
magnetization” can exist. This is the magnetization I, (Fig. 14-6) that a 


+1 



Fig. 14-6. Magnetization as a function of magnetic field tor a Bragg- 
Williams ferromagnet. The metastable parts of the curve lead to the possibility 
of hysteresis. This is the same curve as cw/kT =» —8 in Fig. 14-5. 
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ferromagnet (T < T c ) retains if a sample is placed in a magnetic field 
(H > 0) and then the field is removed (H —*• 0-{-): 

I, = lim 1(H). (14-55) 

H-.0+ 


In this state, even though H = 0, the majority of the spins are |, in the 
direction of the previously applied field. If T > T e , I, = 0. 

In connection with any first-order phase transition, a hysteresis loop is 
possible, as illustrated in Fig. 14-6. Hysteresis is well known in magnetic 
and adsorption systems, for example. 

As a final topic in this section, we show how the Bragg-Williams lattice 
gas equation (14-47) can be derived very simply from a one-site grand 
partition function £ [compare Eq. (7-31)]. We select one particular site 
and regard it as being in the potential field of molecules occupying other 
sites in the lattice at random. Let 0 be the fraction of other sites occupied. 
Then the average number of occupied sites nearest-neighbor to the partic¬ 
ular site is cd. When the particular site is itself occupied, the potential 
energy of the occupying molecule in the field of its neighbors is cOw. 
Therefore, instead of £ = 1 + ffX, as in Eq. (7-31), we have 

£ = 1 -f qe~ e>aliT \. (14-56) 


The average occupation of the site is 


3 = 



qe~ clv>lkT \ 

1 + qe— e,w l kT \ ’ 


(14-57) 


where 8 is considered a fixed parameter in the differentiation. Since the 
particular site selected is equivalent to any other site, we have to impose 
the further condition of consistency, 3 =8. With this substitution in 
Eq. (14-57), we again arrive at Eq. (14-47). 


14-5 Quasi-chemical approximation. In this section we describe an 
approximation that is significantly better than the Bragg-Williams ap¬ 
proximation but still not of unreasonable mathematical complexity. 
Further refinements of various sorts have been worked out, but we shall 
not pursue these here. 

The essence of the present approximation is that pairs of nearest- 
neighbor sites are treated as independent of each other, though we know 
they are not, since they overlap. For example, in a square lattice: 
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We start with Eq. (14-29) and use the assumed independence of pairs 
to approximate the function g(N, M, N 0i ). Each pair of sites can be 
occupied in four ways: 11,01,10, and 00. The total number of pairs of 
sites is cM/2. For given values of N, M, and N 0 i, the numbers of pairs of 
different types are [Eqs. (14-27) and (14-28)]: 

Number of 11 pairs = Nu = ^ • 

Number of 01 pairs = • 

Not 

Number of 10 pairs = —— • 

Number of 00 pairs = Noo — — 20. — ^p. 

If the pairs are independent of each other, each pair in the lattice can be 
assigned to one of the four categories above. The number of ways of 
doing this is 

w(N, M, Not) 

(cAf/2)! 

- mm - Miipf - N)/2] - (AToi/2))![(Noi/2)!] 3 

(14r-58) 

However, as it stands this cannot be set equal to g(N, M, No i), for it will 
rather obviously not satisfy the condition (14-4). This is because many 
configurations counted in « are actually impossible. For example, the 
two pairs of sites shown below cannot both be occupied in the manner 
indicated: 

U °l 



Thus « overcounts the number of configurations. To take care of this, 
we must normalize co: 

g(N, M, Not) = C(N, M)o>(N, M, N 0 1 ), (14-59) 

g»- we/- n - w m g 

To find C(N, M), we replace the co sum by its maximum term. From 
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d In u/dNoi = 0, we obtain 


Nh cN(M - N) 

2 “ 2M ’ 

and therefore 




(/!„), ]'"• OM0) 


Equations (14-58) through (14-60) provide our approximate expression 
for g(N, M, N 0 1 ). It should be noted that this argument accidentally 
gives the right answer for a one-dimensional lattice: put c = 2 in Eq. 
(14-59) and compare Eq. (14-5). Hence all the quasi-chemical relations 
below are exact in one dimension (c = 2). 

From this point we proceed exactly as in Section 14-1, so we omit 
details (Problem 14-7). Equations (14-7) through (14-10) are again 
found, except that now a — N^/cM. From 


InQ = Nlnqe~ ewl2kT + In l(Nt u N, M, T), 


we deduce, as in Eqs. (14-11) through (14-13), 


and 


.. = - 1 + 2fl(l ~ g) T /2 9 

y L (/s + i -t 2d)0 J i - e' 


e/3 


(14-61) 


This is the adsorption isotherm. The symmetry condition y(6)y(l — 0) = 1 
is easy to verify. The equation of state is 


JL = ]n (T Qg + i)(i — fl) T /a i | / 14 oon 

kT “ (L (fi + 1 - 29) J l-tf) (14-02) 

The qualitative behavior of Eqs. (14-61) and (14-62) is the same as in 
Figs. 14—5 and 14-4, respectively, for the Bragg-Williams theory. The 
critical point is at 6 e = 1/2 and a temperature determined from Eq. 
(14-62): 

(t)m„ - 0 = «<*-» + 2. 

or 

*.-.*'*' -Hr, (14-63) 

For c = 4 (square lattice), this gives 4w/kT c = —5.54, which is inter- 
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mediate, as expected, between the Bragg-Williams value (—4) and the 
exact value (—7.05). As a check on Eq. (14-63), if we put c = 2, we find 
T e = 0, as in Section 14-1. Incidentally, the Bragg-Williams result 
follows from Eq. (14-63) if we let c — > oo holding cw constant: cu>/kT c — * 
—4. Also, for example, the Bragg-Williams y(6) (Eq. 14-47) can be de¬ 
rived from the quasi-chemical y(6) (Eq. 14-61) in the same limit 
(Problem 14-8). 

The quasi-chemical approximation was first used by Bethe in connection 
with the order-disorder transition in alloys (see Chapter 20). However, 
the mathematical identity of his quite different physical approach with 
the above equations was not obvious at first. Guggenheim soon introduced 
another method, equivalent to Bethe’s and the combinatorial argument 
above, which was responsible for the choice of the term “quasi-chemical.” 
We shall merely sketch Guggenheim’s procedure, f The starting point is 
the intuitively reasonable quasi-chemical equilibrium relation, (14-9). 
[Incidentally, in the Bragg-Williams approximation, u> = 0 in Eq. (14-9).] 
From this and the conservation equations (14-27) and (14—28), we derive 
Eq. (14-8) with a = N%JcM. The configurational energy is then 

E,.m. - JVt.w - w 

From this we can obtain A 00 nfig by integrating the thermodynamic 
equation 

p _ /dAoonfig/T\ 

/Wig - ^ dl/T J N U 

between 1 /T — 0 (random distribution) and l/T. This integration is very 
tedious, incidentally. Then from Acniig we get $ and fi by differentiation 
with respect to M and N, respectively. 

14-6 First-order phase transitions. In this section we make some com¬ 
ments of a general nature on first-order phase transitions (i.e., transitions 
with a latent heat). The exact treatment of Onsager and the Bragg-Wil¬ 
liams and quasi-chemical approximations for a lattice gas provide us with 
examples. The familiar van der Waals equation of state for a fluid (Chap¬ 
ter 16) is another example. The Lennard-Jones and Devonshire (UD) 
theory of liquids (Chapter 16) is still another. 

The solid curve in Fig. 14-7(a) shows typical p-v experimental behavior 
below the critical temperature. The dashed portions represent meta- 


t Sec Fowler and Guggenheim, pp. 437-438, 441-443. 
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(a) (b) 

Fio. 14-7. (a) Typical experimental p-v isotherm behavior below the 
critical temperature, (b) Corresponding curve from approximate theory with 
restraint of homogeneity. 

stable states sometimes observed. If we start with dilute gas (p small) 
and then increase the pressure, when p = p 0 (and v = vg) is reached, 
another phase—liquid with v = v L —suddenly appears in the system. 
The sharpness of the appearance of a new phase is the feature of particular 
theoretical interest. In this respect, a phase transition is quite different 
from a chemical transition (i.e., chemical reaction), for in the chemical 
case the equilibrium composition varies in a smooth and continuous way 
with change in a variable (e.g., total pressure or temperature). The 
essential difference is that there is a cooperative aspect to a phase transition 
which is missing in a chemical reaction. Thus in a dilute gas reaction 
A B, the probability that any given molecule is in state A (or state B) 
is independent of the state of the other molecules in the gas. But, in a phase 
transition, if A = molecule in dilute phase and B = molecule in dense 
phase, there is a tendency for a large number of molecules to switch as a 
group from state A to state B because the molecules in state B can stabilize 
each other (hence the term “cooperative”) through intermolecular attrac¬ 
tions. Unlike the chemical reaction example, the tendency for the conver¬ 
sion of a particular A into a B is not independent of the state of other 
molecules; rather, the conversion is aided by the presence of other B mole¬ 
cules. Thus a phase transition resembles a landslide or autocatalytic 
process. 

An example of the distinction between a smooth (chemical) transition 
and a sudden (phase) transition can be seen in the magnetic system dis¬ 
cussed in Sections 12-4 and 14-4. In Section 12-4, there are no interactions 
between magnetic spins, i.e., the spins behave independently. We there¬ 
fore have a “chemical” type of equilibrium between states f and i. If 
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6 is the fraction of dipoles or spins in state j and 1 — 6 is the fraction in 
state f > then Eq. (12-46) can be written (in the notation of Chapter 10) 

P _ rr _ -AF°lkT _ -2mB/kT 

r=i- K ~ e ~ 

That is, K is the equilibrium constant and A F° = 2 mH is the energy 
change in the process: state f -* state j. If we vary the magnetic field 
or temperature, K will change smoothly and the transition from pre¬ 
dominantly one state (at equilibrium) to predominantly the other will 
take place smoothly. On the other hand, if we introduce a negative inter¬ 
action energy between nearest-neighbor spins in the same state [J > 0 in 
Eqs. (14-49) through (14-55)], the spins tend to switch states coopera¬ 
tively instead of independently and a sudden instead of smooth transition 
will take place at H = 0 if H is varied holding T constant at T < T 0 
(Fig. 14-6). 

The occurrence of a sharp phase-transition point can also be understood 
from Fig. 14-8. Curve G shows the typical dependence of the chemical 
potential of a dilute gas on pressure at constant temperature (Eq. 4-25). 
Curve L represents the chemical potential of the liquid. This curve is 
almost flat because of the relatively small volume per molecule in the 
liquid state; i.e., (dp/dp) r = v. The stable phase at any p is the one with 
lower fi. Because of the quite different slopes, the curves intersect at a 
sharply defined pressure p 0 (the vapor pressure at the temperature under 
consideration). Thus, if we start with p < p 0 , only the gas phase is 
present, for its chemical potential is lower. On increasing p, the liquid 
phase appears very suddenly at p = p 0 , the crossing point of the p-p 
curves. Both phases are present at p = p 0 , and only liquid is present when 
p > po- The dashed curves again correspond to metastable states. 



Fia. 14-8. Chemical potential as a function of pressure (T constant), 
showing liquid-gas transition. 


258 


LATTICE STATISTICS 


[CHAP. 14 


Just as for a chemical equilibrium, we can interpret in a general way 
the relative stability of the gas (dilute) or liquid (dense) phase by ob¬ 
serving that the gas phase is favored by a relatively high entropy and that 
the liquid is favored by a relatively low (intermolecular potential) energy. 
The dense phase is always more stable at low temperature or high pressure, 
and the dilute phase is more stable at high temperature or low pressure. 
At the transition point (p = Po, Mo = ml), the energy and entropy 
effects just balance each other. 

From the Bragg-Williams, quasi-chemical, van der Waals, or UD 
(Chapter 16) theories, we obtain a continuous loop, as in Fig. 14-7(b), 
instead of a three-branched p-v curve such as in Fig. 14-7(a). Why this 
discrepancy? A loop is always encountered in a theory which (a) uses the 
partition function Q and (b) introduces implicitly or explicitly the restraint 
of uniform density through the extent of the system. It is not possible 
under this restraint for two phases of different density to exist together in 
the system, as required at p = p 0 in Fig. 14-7(a). The loop in Fig. 14-7(b) 
is the p-v curve of an artificial system forced to be homogeneous (one den¬ 
sity) under all conditions. A three-branched curve as in Fig. 14-7(a) will 
result if Q is evaluated exactly (all configurations being represented, 
including those in which the density is different in different regions) or 
even if Q is evaluated approximately but all possible densities are allowed 
in any small region of the system. It should be added that mathematically 
sharp comers occur in the curve only in the limit of an infinitely large 
system. If the system is large but finite, the corners will be rounded off 
somewhat. 

It is easy to see that the partition functions E and A can never lead to a 
loop, regardless of the nature of the approximation used in Q. This follows 
from the fluctuation equations (2-10) and (2-14): (dp/dv)? cannot be 
positive. 

A much more detailed discussion of first-order transitions will be found 
in S. M., Section 28 and Appendix 9. 

So-called higher-order phase transitions in which there is no latent heat 
are also possible. A theoretical example is the two-dimensional lattice gas 
of Section 14-3, restricted to 6 = 1/2. If the temperature of this system is 
varied, a transition occurs at T = T c : there is a singularity in the heat 
capacity (Fig. 14-3) though the energy is continuous. The same kind of 
heat-capacity curve is found experimentally and/or theoretically in, for 
example, a ferromagnet or antiferromagnet at H = 0, a binary solution 
with composition held fixed at the critical composition (Chapter 20), and 
the order-disorder transition in alloys (Chapter 20). 

For a review of the whole subject of phase transitions, the reader is 
referred to Temperley (Supplementary Reading list). 
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Problems 

14-1. Prove the symmetry property y{6)y{\ — 6) = 1 for Eqs. (14-13) and 
(14-47). (Pages 240 and 249.) 

14-2. Calculate a few points and draw approximate curves for $/kT against 
0 and 1/6 for an ideal lattice gas, Eq. (7-7). (Page 240.) 

14-3. Derive the thermodynamic equation 

(dn/kT\ 1 (di/kT\ 

\ d9 ) T 6\ 96 )t 

Note the connection with Problem 7-13. (Page 240.) 

14-4. Calculate T c for a two-dimensional (square) lattice gas, using x c = 
0.4142 — € M/2tr « and w — —400 cal-mole -1 . (Pages 243 and 249.) 

14-5. Derive Eq. (14-25) for the heat capacity of a one-dimensional lattice 
gas with 6 — 1/2. (Page 244.) 

14-6. Use Eqs. (12-38) and (14-50) to derive the relation between I and H 
for a Bragg-Williams ferromagnet. (Page 250.) 

14-7. Fill in the details in the derivation of the quasi-chemical equations, 
(14-60) through (14-63). (Page 254.) 

14-8. Derive the Bragg-Williams y(6) (Eq. 14-47) from the quasi-chemical 
y(6) (Eq. 14-61), using the limit c —*• «° (cto constant). (Page 255.) 

14-9. Apply the (quasi-chemical) method of Section 14-5 to a one-dimensional 
lattice gas with a first-neighbor interaction energy «>i and a second-neighbor 
interaction energy u> 2 . Instead of assuming independent pairs of sites, assume 
independent triplets. 

14-10. Introduce second-neighbor, and higher, interactions into the Bragg- 
Williams lattice gas treatment. 

14—11. Derive equations for 5 and E for a Bragg-Williams lattice gas, con¬ 
sidering to a function of temperature, to(T). 

14-12. When H 2 gas at pressure p is absorbed by Pd metal, the absorbed 
hydrogen is in the form of atoms. Use the equilibrium condition jbh, (gas) = 
2/r H (absorbed) to deduce the absorption isotherm p{6) using the Bragg-Williams 
approximation. 

14-13. Expand the one-dimensional $/kT in powers of 6 in Eq. (14-16) and 
verify Eq. (14-40) for B 2 when c = 2. 

14-14. Define P(2V) = Q(N, M, T)\ N for a Bragg-Williams or quasi-chemical 
lattice gas. P(N) is proportional to the probability that an open system 
(X, M, T given) will contain N molecules. Discuss the form of the function 
P(N) when to < 0, T < T, and: (a) X = q~ 1 e CKl2tT (i.e., In y = 0 in Fig. 
14-5); (b) X different from the value in (a) but still within the extent of the 
loop (Fig. 14-5); and (c) X outside of the extent of the loop. 

14-15. Apply the argument of Eqs. (14-56) and (14-57) to a pair of nearest- 
neighbor sites instead of to a single site. That is, treat the pair exactly 
[see Eq. (7-32)), and assume all other sites in the lattice are occupied at random. 
Show that, in this approximation, 

2fe ( ‘- 1)(2, - 1) 

m “ (** + 1 - B 2 ) 1 ' 2 + (1 - 2 6)x ' 
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Is this result exact in one dimension? Does it satisfy y(6)y( 1 — 0) = 1? Does 
it reduce to the Bragg-Williams approximation when w/kT —► 0? 

14-16. Consider the partition function for a lattice gas, 


W,*,T) = Q(N, M, T)*? 


M-N 

-QW 


, n, T)/[ 


. ,Q(N,N+1,T ), 

1+ " , + "T 


where ij = e~* /kT . This expansion is valid at high pressures where there are 
very few empty sites (6 —» 1). Use the same general procedure as in Eqs. (14-30) 
through (14-38) to obtain an exact expansion of (1 — ff)/9 in powers of n 
through the i} 2 term. 

14-17. Obtain the Bragg-Williams In Q, Eq. (14-42), by starting with the 
exact Eq. (14-29) and making use of the high-temperature expansion * «* 1 + 
(u>/2 kT) + • • •. (This method, due to Kirkwood, can be extended to higher 
terms; see Eqs. (20-30) through (20-32).] 
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CHAPTER 15 
IMPERFECT GASES 

This chapter is concerned with gases which are dilute, but not so dilute 
that we can ignore intermolecular forces altogether. The first two sec¬ 
tions, in which our main object is to relate thermodynamic virial coeffi¬ 
cients to intermolecular forces, are the basic ones. The rest of the chapter 
is devoted to a few special topics. A much more complete treatment of 
this subject will be found in S. M., Chapter 5. 

The recurring theme throughout this chapter is that the application of 
the grand partition function to a dilute system such as an imperfect gas 
makes possible the reduction of a many-body problem in statistical me¬ 
chanics to one-body, two-body, etc., problems. 


15-1 Virial expansion for a one-component gas. The equation of state 
of a sufficiently dilute gas is p = pkT. This is a universal law, the same 
for all gases. At higher concentrations the equation of state (experimental 
or theoretical) can be put in the form 

jgi = P + B 2 (T)p 2 + B 3 (Dp 3 + ■•; (15-1) 

known as a virial expansion. The B n (T) are called virial coefficients. 
These coefficients are in general different for different gases and depend in 
particular on intermolecular forces. Roughly speaking, when the gas is 
dense enough so that pairs of molecules spend appreciable amounts of time 
near each other, the term in B a must be introduced. Interactions between 
three molecules involve B a , etc. The series (15-1) converges, when T < T c , 
for values of p up to p = 1/vq (Fig. 14-7), at which point liquid begins to 
appear in the system and there is obviously a singularity in the function 
p(p). When T > T c , the series converges for all p. 

From the Taylor expansion 


JL = 

pkT 


1 + 


( dp/pkT' 
\ dp 


1 ,+ 

/r.p-o 


1 ( d 2 p/pkT\ , 
2! \ ap2 /T.p-o P 


+ 


(15-2) 


we have the thermodynamic relations 


B n (T) = 


1 (d n ~ l p/pkT\ 

(n — 1)! \ dp»-i Jt.p -0 


(»= 2,3,...). (15-3) 
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This emphasizes the fact that the virial coefficients are properties of the 
gas in the limit of zero density, p = 0. This feature of the B n will be 
confirmed by our statistical-mechanical expressions below. 

First we give a quite general statistical-mechanical argument which is 
valid for any one-component gas that possesses a virial expansion [not 
all gases do, incidentally; e.g., a plasma of ionized hydrogen atoms (see 
Chapter 18)]. The results will be applicable, for example, to polyatomic 
gases, degenerate (quantum) gases (Chapter 22), etc. After deriving gen¬ 
eral relations, we shall turn specifically and in more detail to a classical 
monatomic gas (Section 15-2). 

The grand partition function is most convenient here, for reasons al¬ 
ready explained [see Eqs. (1-69) and (14-26)]. We have 

S(X, V, T) = e pVlkT = £ Qn(V, T)\ n = 1+ Y, GatX w , (15-4) 
Ateo Ate i 

where 

Q N (V,T)mQ(N,V,T), X = e>' kT . 


In Eq. (15-4), we have put Qo = 1 since, when N — 0, the system has 
only one state, and this with energy E = 0 [see Eq. (1-29)]. Equation 
(15-4) is a power series in X, the absolute activity. It proves very con¬ 
venient later if we define a new activity, z, proportional to X and having 
the property that z —* p as p — »0 [as in Eq. (14-35)]. To find the desired 
connection between z and X, we use the limit X —» 0 in Eq. (15-4): 

In S = g = Q,X + - •, 

Then clearly z = Q{K/V, since p = TJ/V. The partition function Q t 
(one molecule in V) is the same quantity as q in Eq. (3-10). If we put 
zV/Qi in place of X in Eq. (15-4), the result is 



To simplify notation, we define Zn(V, T) by the relation 

Zn{V,T) Qn{V, T)V n 
N\ Qi(V,T)" ' 


(15-5) 


In particular, Z\ = V. In classical statistical mechanics, Zn turns out. 
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with this definition, to be the configuration integral [see, for example, 
Eq. (6-22)]. We shall verify this statement in the next section. Then, 
finally, 

S(A, V, T) = e pVlkT = 1+ £ A (15-6) 

jv>i ni 


Now we take the logarithm of both sides of Eq. (15-6), expand the 
logarithm on the right, divide by V, and obtain an expansion for p in 
powers of z: 

j§5 - £ &, W, (15-7) 

where 


1! V6i — Zi = V, bi = 1, 


2!76 2 = Z 2 - Zj, (15-8) 

3!Vhj — Z 3 — SZ 1 Z 2 4- 2Zj, 


etc. A general relation is available* relating the 6, to the Zs- To convert 
the z expansion, (15-7), into a p expansion, as in Eq. (15-1), we use the 
equation 




(15-9) 


This gives 

P = £jM7V. (15-10) 

ii . 1 


If we invert this series to get 2 as a power series in p, we can substitute 
2 (p) into Eq. (15-7) and obtain the required virial expansion. To invert 
the series, we use for simplicity a straightforward algebraic method, 
though more elegant procedures are available for this purpose. We sub¬ 
stitute 

2 = p + o 2 p 2 + agp 3 + • • • (15-11) 


in Eq. (15-10) to obtain an identity in p. Equating coefficients of like 
powers of p on the two sides of the equation, we find 


a 2 = —26 2 , 

<13 = —363 — 4a 2 b 2 — —363 8 b 2 , 


* See, for example, S. M., Eq. (23.44). 
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etc. As a final step, we then put Eq. (15-11) (with the a’s just deduced) 
in Eq. (15-7) and get 

= P + Bs(r)p 2 + Bs(DP 3 + * • •, (15-12) 

where 

B 2 = - 6 2 , B a = 4ba - 26 3 , etc. (15-13) 

In retrospect, we see, above, that we have found the virial coefficients 
B n in terms of the by, the by in terms of the Zn, and the Zs in terms of the 
Q n . We note further that to calculate B 2 , we need only Qi and Q 2 ; to cal¬ 
culate B s , we need Q u Q 2 , and Q 2 ; etc. Now Qi(V, T) is the partition func¬ 
tion for a angle molecule in a box of volume 7; Q 2 (V, T) is the partition 
function for two molecules in 7 ; etc. Thus even though the actual gas 
has, say, 10 2 ° molecules in it, we have reduced the calculation of B 2 to 
one- and two-molecule problems in quantum mechanics (i.e., we need the 
energy levels for one and two molecules in 7 to compute Qi and Q 2 ), etc. 
In view of the comments at the beginning of this section, this result is 
intuitively reasonable. Thus we expect that binary interactions become 
significant when the B 2 term is needed in the equation of state, (15-12), 
and this is confirmed by our finding that B 2 depends on the properties 
of at most two molecules in the volume 7. That no additional molecules 
are in 7 in our computation of Qi and Q 2 (for B 2 ) corresponds to the fact, 
mentioned in connection with Eq. (15-3), that the virial coefficients are 
properties of the gas at zero macroscopic density. That is, B 2 depends on 
binary interactions in a vacuum, B 2 on ternary interactions in a vacuum, 
etc. 

At this point we digress to examine one step in the above argument 
which happens to lead to correct results but which needs justification. 
This is the step (15-6) —» (15-7). [We employed the same procedure 
in Eq. (14-36) and in finding the connection between z and A following 
Eq. (15-4).] The difficulty is that the expansion In (1 + x) = x — 
(l/2)a: 2 + • • •, which we have made use of, is valid only if x 2 < 1 ; 
but in 1 + Vz + • • • in Eq. (15-6), 7z is of order 77. The following argu¬ 
ment avoids this complication. We start with the desired form of expan¬ 
sion, (15-7), and work backward to find the connection between the by 
and the Zn. That is, at the outset, the by in (15-7) are undetermined 
coefficients. From the expansion (15-7), we form the function 

e y ' kT = exp (7 2 bysA = II e™** 

\ y>i ) i > 1 


(15-14) 
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This is a power series in z. The coefficient of z N in the series is 



(15-15) 


where the sum is over all sets m = mi, ..., m*r satisfying the condition 

2 jmj = N. 

i-l 

On equating coefficients of z N in Eqs. (15-6) and (15-14), we have, then, 
that Zn/N\ is equal to the expression (15-15). The first few relations are 


Z x = Vbx = V, 

iZ 3 = Vb 2 + 4(7b,) 2 , (15-16) 

\Z 3 = Vb 3 + (F6,)(76 2 ) + UVh) 3 . 

These are easily seen to be equivalent to Eqs. (15-8), which give the b, 
explicitly in terms of the Z N . 

We return now to (15-12) and related equations and derive a few more 
general expressions. The fugacity / of an imperfect gas is, like z, propor¬ 
tional to X, but with a proportionality constant such that / —► p as p -* 0. 
That is, / —* pkT as p —» 0; hence the connection between / and z 
is / = zkT. Thus Eq. (15-11) gives f/kT as a power series in p. The in¬ 
verse of Eq. (15-7) would give f/kT as a power series in p/kT (Problem 
15-1). Also, if we define an activity coefficient 7 by the relation z = 7 p, 
then, from Eq. (15-11), 

7 = - = 1 + a,2P + asp 2 -j- —. (15-17) 

An alternative and more elegant expression for 7 can be found as follows. 
From z = Q{k/V and z = 7 p, we have 

- In (~) + In p + In 7(p, T), (15-18) 

where V/Q\ is a function of T only, since is proportional to V (see 
Problem 3-1, where g has the same meaning as Qi here). If we now inte¬ 
grate the thermodynamic equation 

d (wr) = vd fe) = l (c^r) T dp (r constant) ’ 
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using the virial expansion (15-12) for p/kT, we find 

^ = constant + lnp + ^ ^ ^ B k +ip k . (15-19) 

From the definition of z and the limit p —> 0, we see that the integration 
constant is In (V/Qi). Therefore, from Eqs. (15-18) and (15-19), 

In 7= — X>(2V, (15-20) 

*>1 

where we have defined ft by 

ft - - (^ir) (i5- 2 D 

Incidentally, it should be noted that Eqs. (15-7), (15-10), (15-12), and 
(15-20) are all thermodynamic expansions that can be interrelated by 
purely thermodynamic operations. Hence the connections obtained above 
between the bj, B„, and ft are essentially thermodynamic in origin. 
Molecular theory enters when we relate any of these coefficients to the 
Zn or Qn, as in Eqs. (15-8). 

A comment on the orders of magnitude in Eqs. (15-8) may be helpful. 
From Eq. (15-7), we have that 



Now let us write the equation for bj in (15-8) in the form 


j'.Vbj _ JJbj_ _ Zj - 

Vi ~ Vi -1 ~ Vi 

From the definition (15-5), we see that Zj = 0(V } ). Therefore each 
separate term (only one is shown) on the right of the above equation is of 
order unity, but the complete right side is of order 1 /N 3-1 , since this is 
the order of the left side. Thus the bj in Eq. (15-8) represent small terms 
left over after cancellation of the major contributions to the right-hand 
side. 

15-2 One-component classical monatomic gas. The special case we 
discuss in this section is a one-component classical monatomic gas with 
an intermolecular potential energy assumed pairwise additive. The 
reader should review Eqs. (6-15) through (6-22) and Appendix IV, where 
basic equations and criteria for the use of classical statistics are developed. 



15 - 2 ] 


ONE-COMPONENT CLASSICAL MONATOMIC GAS 


267 


We observe first that for this system Q, = q = V/A. 3 , according to 
Eq. (4-10). Then, in Eq. (15-5), QnV n /Q, n = Q N A 3N . Thus, on com¬ 
paring Eqs. (15-5) and (6-21), we see that the two Ztfs have the same 
relation to Qn- Hence, in this special case, Zn of Section 15-1 is just the 
configuration integral defined by Eq. (6-22). The first three configuration 
integrals are 

Zi = fdr, = V, 
v 

Z 2 = Jje~ ulr ' t)lkT dr, dr 2 , (15-22) 

v 

Z z = Jll e-'“ <r, ‘>+“ (r, »>+“<’-«)i /tr dtt dx 2 dr 3 , 

v 

where di = dx dy dz, r 12 = |r 2 — ri(, and w(r) is the intermolecular pair 
potential (Appendix IV) which, in principle at least, can be calculated 
from quantum mechanics. Beginning with Z 3 , the assumption of pairwise 
additivity in the potential energy Z7(r 1( r 2 , r 3 ) appears. That is, 

U — u(r J2 ) + u(ri 3 ) + «(r 23 ). 


From Eqs. (15-8) and (15-13), the second virial coefficient of a classical 
monatomic gas is 

B 2 (T) = ~& 2 = ~ 2V (Zz " Z?) 

= “ 2T //[e _u(ri>)/ir - 1] dn. (15-23) 
r 

Since «(r 12 ) goes rapidly to zero for intermolecular distances ri 2 greater 
than, say, 15 or 20 A, the integrand in Eq. (15-23) is nonzero only when 
the elements of volume dti and dr 2 are close to each other. For this reason 
we change variables from ri and r 2 to rj and r J2 = r 2 — ri (position of 
molecule 2 relative to position of 1 as origin). Integration over ri 2 leads 
to a result that is independent of the location of dii, except when drj 
is in a region of negligible extent {V is macroscopic) within a distance of 
order 20 A from the walls. Then 

BziT) = - ± f dr, f [e~^ lkT - l]dr l2 

V V 
r*> 

= - i Jo [e~ u(r)/kT - l]4irr 2 dr, (15-24) 

where we have put dr 12 = 4irr 2 dr, and the upper limit r — oo can be 
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used, since the only contributions to the integral come in the first 20 A 
or so. 

For a (hypothetical) gas of hard spheres, 


and 


u(r) = +«> r < o 

= 0 r > a, 


(15-25) 



(15-26) 


where a is the distance of closest approach of the centers of two spheres, 
or the diameter of one sphere. Thus B 2 is four times the volume of a 
sphere. 

If we use the much more realistic Lennard-Jones potential, (IV-1), for 
u(r), the second virial coefficient becomes 

B 2 (T) = —2rr* 8 jf [exp (~^ y~* - ^ y~ 13 j - l] y 2 dy, (15-27) 


where y = r/r*. This result predicts that for all molecules with an inter- 
molecular potential of the Lennard-Jones form, B 2 /r* 3 is a universal 
function of kT/e. This is another example of a law of corresponding states. 
Thus, if experimental values of B 2 as a function of T for a given gas 
are plotted by adjusting both horizontal and vertical scales (thereby de¬ 
termining values of e and r* as given in Table IV-1), then the experimental 
curve can be made to coincide with the theoretical curve of B 2 /r* 3 against 
kT/e calculated from Eq. (15-27). This is illustrated in Fig. 15-1 for A, 
Ne, N 2 , and CH 4 . (The two latter gases behave effectively as mona¬ 
tomic gases as far as the equation of state is concerned.) 

The Boyle temperature Tb is the temperature at which B 2 = 0. The 
theoretical value, from Eq. (15-27), proves to be kTa/e = 3.42. At 
temperatures below Tb, B 2 is negative owing to the predominant effect 
of the potential well in u(r) on the integral for B 2 . At sufficiently high 
temperatures, on the other hand, the potential well gets “washed out” 
(e/kT —* 0) and the short-range repulsion dominates. Hence, B 2 is posi¬ 
tive, as for hard spheres. At very high temperatures, the effective “hard- 
sphere diameter” decreases with increasing T, because the molecules can 
approach each other more closely when they are more energetic. Therefore 
B 2 (T) passes through a maximum . 

Turning now to £ 3 , Eq. (15-13) gives B 2 in terms of b 2 and b 2 . Define 
Xij = e~ uir *i >lkT , and recall that 
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Fig. 15-1. Reduced second virial coefficient (B 2 /V* 3 ) as a function of reduced 
temperature ( kT/t ). Curve C is the classical curve calculated from Eq. (15-27). 
The experimental points on curve <7 are a mixture of points for A, Ne, N 2 , and 
CH 4 . Curves A and B are the calculated quantum curves (Section 22-5) for He 
and H 2 , respectively, with corresponding experimental points also shown. 


Now we notice that 

JJJ (*12 — l)(*is - 1 ) dr 2 * 3 

V 

= v[J(x 12 - l)dr 12 J[J (*13 - 1)*1 3 ] 


= 4 blV. 


(15-28) 

This gives us an expression for 6§. For b 3 , we have from Eq. (15-8), 
6Vb 3 = fff fa 2*13*33 — *12 — *13 — Z23 + 2 ) tfr 2 dr 3 . (15-29) 


For symmetry, we combine Eqs. (15-28) (using the first line) and (15-29) 
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in the form 

ZB 3 V = 12517 - 663 7 

= JJJ [(#12 — 1 )(- Tl 3 — 1 ) + (*12 _ 1)(®23 — 1 ) + (*13 _ 1)(^23 _ 1 ) 

V 

— (* 12 ^ 13^23 — 3?|2 — *13 — X 2 3 + 2 )] *1 dt 3 dx 3 . 


After cancellation, this gives 

B 3 (D - ^vfff (Xl2 ~ l){Xi3 ~ 1)( * 23 “ 1} dtl dt2 *»• (15_30) 

v 

A fairly long calculation leads, for hard spheres, to B 3 = (5/8)fij, where 
B 2 is given by Eq. (15-26). 

For the Lennard-Jones potential, we first change variables to r t , r I2 , 
t\ 3 and integrate over r 1( giving a factor 7. Then, as in Eq. (15-27), 



kT/t 

Fio. 15-2. B 3 /r* 6 as a function of kT/t, calculated from the classical equa¬ 
tion (15-31). The experimental points are a mixture of points for A, N 2 , and CH 4 . 
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This gives B 3 /r* 6 as a universal function of kT/e. Figure 15-2 shows this 
function, calculated numerically from Eq. (15-31) (after rearrangement 
for computational convenience). The experimental values of B s in the 
figure have been “reduced,” using value of e and r* obtained from the 
second virial coefficient (i.e., by fitting the theoretical curve in Fig. 15-1). 
The agreement with experiment is only fair. Possible contributions to the 
discrepancy are: (1) error in the experimental values of B 3 ; (2) lack of 
effective spherical symmetry in N 2 ] (3) approximate nature of the Len- 
nard-Jones potential; and (4) nonadditivity of the intermolecular poten¬ 
tial of three molecules. 

The dependence of B 2 /r* 3 and B 3 /r* 6 on kT/e only, for the Lennard- 
Jones potential, is in fact a general result for all virial coefficients [see 
Eq. (15-38) below] if pairwise additivity of the potential is assumed for 
B 3 , B 4 , .... In this case the complete equation of state (15-12) can be 
written, using v = 1/p, 



(15-32) 


It is clear that not only the Lennard-Jones potential but any two-param¬ 
eter potential which is of the form 


«(r) = (energy parameter) X »( d i stapce p arameter ) (««») 


and which is assumed pairwise additive, will lead to the law of corre¬ 
sponding states for imperfect gases, (15-32). 

Equations (15-32) and (15-33) imply that (subscript c refers to critical 
point) 





(15-34) 


PcVc _ „ Tb _ , 

kT c ~ 4 ’ T c ~ C ®’ 


where C \,..., c 3 are dimensionless constants, the same for all gases obey¬ 
ing Eq. (15-33) with the same function h. Table 15-1 furnishes a test of 
all but the last of these relations for simple molecules, using experimental 
critical constants reduced by values of e and r* from the experimental 
second virial coefficient. The constancy of the “constants” is very good 
but not excellent. We can conclude that, to a rather good approximation, 
the intermolecular potential for effectively spherical molecules satisfies 
(15-33) and is pairwise additive. 
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Table 15-1 

Reduced Critical Constants 



V* 

r *3 

kT 

6 

p c r* 3 

€ 

PeV, 

kT e 

Ne 

2.35 

1.25 

0.157 

0.295 

A 

2.23 

1.26 

0.164 

0.290 

Xe 

2.05 

1.31 

0.187 

0.293 

N2 

2.09 

1.33 

0.185 

0.291 

0 2 

1.90 

1.31 

0.201 

0.292 

ch 4 

2.09 

1.29 

0.178 

0.288 

Average 

2.12 

1.29 

0.179 

0.292 


Equations (15-34) and Table 15-1 provide a method* for the estima¬ 
tion, from critical constants, of r* and e for other molecules that are of the 
same type but which are not in the table. That is, we can use the relations 



*3 v e *3 0.179c O.mkTc 

r* — > or r* =-=- - • 

2.12 p c p e 

There is a similarity in the form of Eqs. (15-23) and (15-30) for B 2 
and B 3 which can be shown to be general in the case of pairwise ad¬ 
ditivity. We give the result but not the proof. But first we digress to 
introduce “cluster diagrams.” Define fa = x,,- — 1. The function 
fa = e -* (r *H kT — 1 is nonzero only when r,-,- is small. The molecules 
1, 2 are said to form a “cluster” when / I2 occurs as the integrand in an 
integral over dii and dt 2 . The term is appropriate because the integrand 
is nonzero only when the two molecules are near each other. Equation 
(15-23) provides an example. Three molecules 1, 2, 3 form a cluster when 
any of the following integrands occur (nonzero only when all three mole¬ 
cules are near each other): / 12 / 13 / 23 , / 12 / 23 , /i 2 /is, f 13 / 23 - Equation 
(15-30) is an example. The integrands in Eqs. (15-23) and (15-30) can 
be represented schematically by “cluster diagrams”: 



(15-36) 


* T. L. Hill, J. Chem. Phya. 16, 399 (1948). 
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A line between two circles (molecules) t and j means that a factor /.,• is 
present in the integrand. The other cluster diagrams for three molecules 
are: 




(15-37) 


Now Eq. (15-21) relates Bk+i to 0*. The general result* referred to at 
the beginning of this paragraph is that 


0 * 


1 

k\V 



*+i dri drj ... dr*+i, 


(15-38) 


where S' 1,2 .* +1 is the sum of all different products of fa that connect 

molecules 1,2,..., k + 1 in a “doubly connected” cluster diagram. In a 
doubly connected diagram, between each pair of molecules there are at 
least two entirely independent paths which do not cross at any circle. In 
addition, a cluster of two molecules is considered doubly connected, for 
classification purposes, and is sometimes written 0=0. Thus the 
diagrams (15-36) are doubly connected, but those in (15-37) are only 
singly connected. Hence S' 1,2 = fia and £' 1 . 2,8 — fufiafia- This is 
consistent with Eqs. (15-23) and (15—30). However, for 0 8 and B i} there 
are ten terms in <S'j .2.3.4 corresponding to the following ten doubly con¬ 
nected diagrams: 



(15-39) 


* See S. M., Chapter 5, for proof. 


274 


IMPERFECT GASES 


[CHAP. 15 


Usually fit is referred to as an “irreducible cluster integral,” and bj is 
called a “cluster integral.” Cluster diagrams were first introduced by 
Mayer, starting from the canonical ensemble.* 

15-3 Two-component imperfect gas. Imperfect gas theory can easily 
be extended to gas mixtures. We illustrate this possibility by considering 
a binary gas and retaining only those terms necessary to deduce the second 
virial coefficient. We start off with equations not restricted to any particu¬ 
lar kind of gas (classical, quantum, polyatomic, etc.). The grand partition 
function is 

S(Xi, X 2 , V, T) = e pvlkT = Y, QftiitiiV, T)^ 1 . (1JH10) 

N i,Vj>0 


We define z,, z 2 , and Zn x n 2 by the equations 




QioXi 



(15-41) 


Z Nl w, _ Qv.v.F* 1 ** 2 

NilNal w 


(15-42) 


The coefficients Qio/V and Qoi/V in Eqs. (15-41) are functions of T only. 
The definitions (15-41) and (15-42) will prove below to have the desired 
properties that (a) z, —> p, and z 2 —* p 2 when pi and p 2 —* 0, and 
(b) becomes the configuration integral in classical statistical me¬ 

chanics. Equation (15-40) can now be rewritten as 


S = e» V!kT = 


s 


Wi,iVj>0 


Zn\N% _N 1 Ni 

Wi!W 2 ! ** * 2 


— 1 + Vzi + VZ2 +■ ^20*1 + Z i lZ,Z 2 + \Zo2,Z% + • • •. (15-43) 


On taking logarithms and expanding [see Eqs. (15-14) through (15-16)], 


^; = + z 2 + 6 2 o(T)2? + bu(T)ziZ2 + 6o2(?>2 + • • •> (15-44) 


where 

6 10 = 6 0 , = 1, 2!0!F6 20 = Z 20 - V 2 , 

1!1!F6„ = Zi i - V 2 , 0!2!F6 02 = Z 02 - V 2 . 


(15-45) 


* See Mayer and Mayer, Chapter 13. 
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From 



we have 

Pi = *1 + 2520*1 + 5n«i22 + • • •, 
Pi — *2 + 25 0 2*2 + 5i \Z\Zi + • ■ •, 

and the inverses (Problem 15-2) 

*1 = Pi — 26 2 oPi — 5nPiP2 + • • •, 
Zi = pi — 25q2p 1 — 5nPiP2 + • • • • 


(15-46) 


(15-47) 


These last equations also provide series in the activity coefficients 
?i = zi/pi and y 2 = zi/Pi- 

Substitution of Eqs. (15-47) in Eq. (15-44) gives the desired virial 
expansion 

= Pi + Pi + Bio(T)Pi + Bn(T)PiPi + B 0 i(T)p2 + • • •, (15-48) 
where 

2?20 = —520. #11 = ”5n, B oa = —b 0 i- (15-49) 


The coefficients B 20 and Z? 02 are just the second virial coefficients of the 
two pure gases (Section 15-1); Bn is new and depends on the properties 
of two molecules in the volume V, one of each species. 

For a classical monatomic gas mixture, we have from Eq. (6-25) that 


where 


n _ Z NlNt 

~ JVjlJValAjWtAl* ’ 


(15-50) 


Z N i Ni = Je- UN ' NilkT d{N 1 ) d{Ni), (15-51) 

r 


and d{iVi} means dt x ... dr Nl for species 1, etc. Comparison with Eq. 
(15-42) shows that, in this special case, Z^ l N i defined in Eq. (15-42) is 
the classical configuration integral, (15-51). Then, in Eq. (15-48), B 20 
and Bq 2 are given by Eq. (15-24) for a pure gas [using the appropriate 
u(r) for each gas] and 

Bu = — 6n = — y (Z\\ — y 2 ) 

= - _ 1]4xt 2 dr. 

Jo 


(15-52) 
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In this equation, Uu is the intermolecular potential between one molecule 
of each type. Information about u u can be deduced from very accurate 
measurements on the two pure gases and on the mixture, since only B lt 
in Eq. (15-48) is not a property of a pure gas. If a Lennard-Jones poten¬ 
tial is used for each of the three pair interactions, a good approximation 
for the parameters of uu, in the absence of other information, is 

«n = («i«a) 1/2 » rJi = £( r * + r|), (15-53) 

where ej, e 2 , r* and r\ refer to the pure gases 1 and 2. The first relation 
follows from the simple theory of dispersion forces, and the second would 
be correct for hard spheres. 

The above equations provide a summary of the statistical-mechanical 
relations that determine the thermodynamic properties of a slightly im¬ 
perfect binary gas mixture. The (quantum-mechanical) interactions be¬ 
tween the three kinds of pairs of molecules are the basic quantities ap¬ 
pearing in these equations. 

Next we investigate briefly an example of a chemical equilibrium oc¬ 
curring in an imperfect gas mixture. It will be recalled that Chapter 10 
was restricted to chemical reactions occurring in ideal gases. Suppose 
that in the binary mixture 1, 2 the dissociation (or association) equi¬ 
librium, 

©^2® , 

as in Section 10-2, takes place. The equilibrium condition is /i 2 = 2jui, 
or X 2 = Xf. Then from the definitions of z\ and z 2 in Eq. (15-41), 

z„V = z\V 2 
Qoi Qio 


or 


zj ( Qio/V ) 2 
z 2 (Qoi/n 


K{T). 


(15-54) 


Here K(T) is the same kind of equilibrium constant as in Eq. (10-6), and 
the Q’a have the same meaning as the g’a in (10-6). However, the equi¬ 
librium constant is equal to a quotient of activities rather than to a quo¬ 
tient of concentrations. The fact that z*/z 2 is a function of T only, K(T), 
is of course a purely thermodynamic result. But the relation of K(T) to 
the Q’b is statistical-mechanical. 

We can also write 


.2 n 2 -V 2 

K{T) - 

Z 2 P 2'2 


(15-55) 
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where 7i and 7 2 are activity coefficients, determined by Eqs. (15-47). 
Thus, 

K(T) = ^ [1 4- (&n — 46 2 o)Pi + •••]• (15-56) 

Pi 

The expansion in brackets here gives the first-order correction term, as¬ 
sociated with gas imperfection, to the ratio pf/p 2 [which is exactly equal 
to K(T) only in the limit as pi, p 2 —* 0]. The linear term in p 2 is omitted 
because it is of order pf. This expansion is thermodynamic in origin if we 
regard the 6,7 as being determined from the experimental equation of 
state, (15-48), and the relations (15-49). But Eqs. (15-45) for the 6,7 
are of a molecular, not thermodynamic, nature. 

A special case of the equilibrium © *=± 2©, above, is the formation 
of dimers, or clusters of two molecules, in a slightly imperfect one-com¬ 
ponent gas. Of course, from a thermodynamic point of view, any dimer 
formation, however defined, is automatically taken care of by the second 
virial coefficient B 2 and need not be discussed explicitly. But for some pur¬ 
poses it is profitable to adopt the alternative but necessarily equivalent 
point of view that dimers exist and are in equilibrium with monomers. 
We shall say something about the definition of a dimer below. Higher clus¬ 
ters (trimers, etc.) can be included in the treatment, but for simplicity we 
confine ourselves to monomers and dimers (i.e., the gas is assumed very 
dilute). Incidentally, we are referring here to “real” or “physical” clusters, 
not the “mathematical clusters” of Section 15-2. A much more detailed 
treatment of clusters of both kinds will be found in S. M., Chapter 5. 

The gas is considered an equilibrium mixture of two species: mono¬ 
mers, ©, and dimers, ©. The partition function of one monomer in V is 
Q 10 ; of two monomers in V is Q 20 (the two monomers can interact with 
each other); and of one dimer in V is Q 0I . The concentration of mole¬ 
cules in the gas, from the one-component point of view, is 


P = Pi + 2p 2 , 


where 


= K(T) 
P 2 


Q10 

VQ 01 ' 


(15-57) 


(15-58) 


The extra term given in Eq. (15-56) is not involved in B 2 , so we drop it 
here. From the above two equations we find 
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or, for the concentrations of monomers and dimers, 

Pi = P - | P 2 , (15-59) 

p 2 = (15-60) 

to terms in p 2 . If we substitute Eqs. (15-59) and (15-60) in Eq. (15-48) 
for a binary gas mixture, we get for the equation of state 

^ = P + (- g - 5 2 °)p 2 + • • •, (15-61) 

where 

- g - - r»]. (16-82) 


This should be the second virial coefficient B a of the one-component gas. 
From Section 15-1, 

<1MS) 

Since Qi and Q i0 have the same meaning, we conclude from (15-62) and 
(15-63) that, for self-consistency, we must have Q 2 = Q 20 + Q 01 - But 
this is just what we should expect since, from the monomer-dimer point 
of view, two molecules in V can exist in two sets of states: (a) two mono¬ 
mers; and (b) one dimer. As far as thermodynamics is concerned, the 
splitting of Q 2 into two parts, Q ao and Q 0 i, is quite arbitrary; it depends 
on the definition of “dimer” (but the same B a is found in any case, pro¬ 
vided Q 20 + Qo\ = Qi)- Hence the numbers of monomers and dimers 
calculated by Eqs. (15-59) and (15-60) are also arbitrary. The most ob¬ 
vious classification for the states of Q 2 is to include the “bound” states 
(in quantum or classical mechanics) in Q 0 i and all others in Q a o- The 
reader interested in pursuing this subject further should consult S. M., 
Section 27. 


15-4 Imperfect gas near a surface. In this section and the next we con¬ 
sider a one-component imperfect gas in an external field. Here the ex¬ 
ternal field is a short-range one provided by a solid surface, as, for example, 
in Eq. (7-2). The present treatment provides a relatively exact (compared 
to Chapter 7) approach to the problem of physical adsorption of gases on 
solids. We shall merely sketch the basic equations here. Further details 
are available elsewhere.* 

* S. M., Appendix 10, where additional references are given. Also T. L. Hill, 
J. Phys. Chem. 63, 456 (1959). 
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r 


0 >) 

Fig. 15-3. Gas in volume V (a) in the absence and (b) in the presence of 
an adsorbing surface. 

The system of interest is a one-component gas in a volume V in the 
absence (Fig. 15-3a) and in the presence (Fig. 15-3b) of an adsorbing 
surface of area a. In both cases the gas has a chemical potential p (or 
activity z) and temperature T. If Tt is the average number of molecules 
in the system with solid present and if TJ° is the average number with 
solid absent, then the number of adsorbed molecules, by Gibbs’ surface 
excess definition, is N — 37°. The basic thermodynamic equation for the 
system with the solid present is [compare Eq. (7-6)] 

dE = TdS - vda - pdV + pdN, (15-64) 

where p is the gas pressure far from the surface. Then it follows that 

S(m, v, a, T) - e (pV+ra)/kT = X) Q*( y . a. ■ ( 15 - 65 ) 

N 

In the absence of the solid, we use the notation 

S°(m, V, T) = e pYlkT = Y, (&( y > r >^’ d 5 - 66 ) 

AT 

where 2° and Q% have the same meanings as 2 and Qn in Section 15-1. 

Although we could pursue this problem using the general formulation of 
Section 15-1, let us investigate instead the specific case of a classical mon¬ 
atomic gas. The potential energy of the gas (when the solid is present) 
is the sum of the usual (Section 15-2) intermolecular potential energy U% 
(solid absent) and the potential energy of the gas molecules in the field 
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of the solid, which we denote by U%. The energy U% is made up of a cum 
of N separate contributions of the form tt*(r), the potential energy of a 
single gas molecule at r in the field of the solid [see, for example, Eq. (7-2)]. 
Then 

Qn ~ = n\\sn ’ (15-67, 

where 

Z% = f e - u " lkT dt , • • • dr at (15-68) 

v 

and 

Z N = J e - (U » +u » )lkT dt!--- dr N . (15-69) 

v 

If we define the activity by z = = X/A s , then 

s = <>«0) 

AT iV ‘ N lyl 

We now follow the same argument as in Section 15-1, and find 

g - r £ »V, = v £HMD 

,>x >>i 

37° = 7 X) 37 = 7V 

i>l j'£i 

37 - 37° = y £ (15-72) 

where the 6’s are related to the corresponding Z’b by Eq. (15-8) (but 

Z i ^ y, 6i 9 * 1), and p(z, T) is the pressure either in the absence of or 
far from the surface. Equation (15-72) is the “adsorption isotherm” 
giving the amount adsorbed as a function of the activity z (or fugacity, 
/ = zkT). At very low gas pressures (the Henry’s law region), p/kT — z 
and the number of molecules adsorbed is 


37 - 37° = y(6i - &?)z = y(6i - i) 

where 

y(6, - i) = z x - V = j[ e - Vi * wlkT - i] dt. 
v 

If u* is a function of the distance £ from the surface only [as in Eq. (7-2) 
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and Problem 7-1], we have 

~wl I 6 "”'"*’' - >1 ( 1M3 > 

The integral determines the relative extent of binding of different mole¬ 
cules on a surface, and this in turn depends on w*(f). As is typical of a 
Henry’s law constant (Chapter 19), the constant is determined by the 
interaction of just one “solute” (gas) molecule with the “pure solvent” 
(solid surface). Equation (15-73) has been applied successfully to ex¬ 
perimental data by Freeman and Halsey, f 
It is left to the reader to obtain explicit expressions (in terms of u* and 
u) for the quadratic terms in the expansion of 37 — F° in powers of z or 
p/kT. 

From Eqs. (15-71) and (15-72), the surface pressure in the limit as 
p -* 0 is given by 

= F(6 X - 6?)* = Tf - F°, (15-74) 

which has the form of the equation of state of a two-dimensional ideal gas. 
Note from Eq. (15-73) that it is possible for N — 77° and <p to be negative. 
Incidentally, if we subtract 

dE° = TdS° - pdV + pdN 0 

from Eq. (15-57), we get the basic thermodynamic equation, in terms of 
surface excesses, 

dE, = T dS, - <p da + p dN„ (15-75) 

where 

E, = E- E°, S. = S-S°, N, = N — N°. 

This is the analog of Eq. (7-6). 

15-5 Imperfect gas in an electric field. The discussion here is a con¬ 
tinuation of that in Sections 12-1 and 12-2, which should be reviewed. 

Section 12-3 was concerned with a dilute gas of independent molecules 
in an electric field. In an imperfect gas or liquid, the molecules are not 
independent and the problem becomes very involved. Each molecule is 
not only in the field (D) of the external charges, but is also in the field of 
the other molecules. These intermolecular interactions are more compli¬ 
cated than usual because (a) dipole-dipole forces have a relatively long 


t M. P. Freeman and G. D. Halsey, Jr., J. Phys. Chem. 59, 181 (1955). 
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range, and (b) both the external field and the field due to other molecules 
induce an additional dipole in a given molecule, which in turn contributes 
to the interaction of the molecule with other molecules. Superimposed 
on these electrostatic interactions are the usual van der Waals attractions 
and repulsions (themselves ultimately, of course, of electrostatic origin). 

Although the properties of a dielectric liquid in an electric field are 
extremely difficult to analyze without approximation, use of the grand 
ensemble and low-density expansions make it possible to treat an im¬ 
perfect gas in a straightforward way. We confine ourselves here to slightly 
imperfect gases (up to the second virial coefficient), and even in this case 
we develop general equations only. Detailed treatment of realistic special 
cases or models involves rather complicated calculations.* 

The grand partition function, (12-15), can be written 

S = 1 + Qi(F, T, D)X + Q 2 (V, T, D)\ 2 + ■■■. (15-76) 

Then 

= InS = <2iX + (O 2 — £<2i)X 2 + • • • (15-77) 

and, from Eq. (12-17) (note the definition of Mm), 

TR = M ^X + (Q 2 M 2 - + • • •. (15-78) 

Also, from Eq. (12-9), 

- X = QiX + (2 Q 2 - Q 2 )\ 2 + • • -. (15-79) 

For a very dilute gas (p, p, or X —* 0), only the leading terms in Eqs. 
(15-77) through (15-79) need be retained. For a slightly imperfect gas, 
for which a second virial coefficient is required in the equation of state, 
Eqs. (15-77) through (15-79) indicate that we have to consider the proper¬ 
ties of only two molecules in the volume V and external field D. Of course, 
in this case, interactions (including dipole-dipole) between the two mole¬ 
cules, and the influence of D on the interactions, have to be taken into 
account. 

From Eqs. (15-77) and (15-79), we can obviously derive, just as in Sec¬ 
tion 15-1, the virial expansion 

j§i = P + £ B»(T, D)p n , (15-80) 

n>2 

* See A. D. Buckingham, J. Chem. Phys. 23, 2370 (1955); A. D. Buckingham 
and J. A. Pople, Trans. Faraday Soc. 51, 1029, 1179 (1955). 
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where the B n and Qn are related by the same formal expressions as in 
Section 16-1. Also, we can show* (Problem 15-3) that the virial expan¬ 
sion for the polarization is 


P = 


E 

v 




(15-81) 


Problems 

15-1. Obtain an expansion for f/kT (/ = fugacity) in powers of p/kT up to 
the cubic term. (Page 265.) 

15-2. Deduce the expansions (15-47) for a binary mixture from (15-46). 
(Page 275.) 

15-3. Use Eqs. (15-77) through (15-79) to verify the coefficient of p 2 in 
the polarization expansion, (15-81). (Page 283.) 

15-4. Estimate values of e and r* for ethane from T c = 305.2°K and 
p e = 48.8 atm. 

15-5. The intermolecular potential for monatomic molecules, 


u(r) 



where a, P, and 7 are positive constants, is often used, (a) If the system (gas) 
obeys the law of corresponding states, what conclusion can be drawn about 
the independence or dependence of a, P, and 7 on each other? (b) If this poten¬ 
tial is made to agree with the Lennard-Jones potential for large r and in the 
location of the bottom of the well (u — —e = minimum at r — r*), what are 
the connections between a, p, and 7 and r* and «? Calculate a, P, and 7 for 
argon. 

15-6. Sketch a plot of the function e~ MWkr — 1 for hard spheres and for 
the Lennard-Jones potential (at several different temperatures). 

15-7. Find B 2 at high temperatures for the potential 


u(r) = +00 r < r* 



r > r*. 


That is, for r > r*, replace e -wir by 1 — (u/kT). 
15-8. Show, using an integration by parts, that 

is equivalent to Eq. (15-24). 


* T. L. Hill, J. Chem. Phys. 28, 61 (1958). A number of other details are 
also given in this paper. 
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15-9. Extend the series of Section 15-1 one more term. 

15-10. Show that the expansion of E/NkT in powers of p for a classical 
monatomic gas starts off as 


NkT “ 2 + 2J wj 0 “ (r)e ° *" dr 4 ’ 

and that a more general relation is 

E 3 . T dbj j 
NkT ~ 2 + j£'dT 


(15-83) 


(15-84) 


15-11. For a classical monatomic gas, put S/Nk at given p and T in the form 


(ideal gas, same p and T) + C(T)p H-. 

Find C{T) in terms of «(r). Show that for a finite temperature, C is always 
negative (as might be expected intuitively). Find the asymptotic form of C 
at high temperatures. Use the u(r) in Problem 15-7 to calculate C at high 
temperatures. 

15-12. Use Problem 15-4, critical constants for methane, and Eqs. (15-53) to 
estimate «ii and rfi for the ethane-methane interaction. 

15-13. Extend Eqs. (15-43) through (15-49) to include the third virial 
coefficient. 

15-14. If Eq. (15-48) for a binary mixture is put in the form 
Jjp “ P + B 2 P* + • • •, P — Pi + P2, 

show that 

B 2 (xi, T) <= Bio{T)x\ + Bn(77*i(l — *i) + Bo2{T){\ — xi) 2 , 

where xi is the mole fraction of component 1. 

15-15. Consider the isomeric equilibrium © (T) in a binary gas mixture. 

Find the analog of Eq. (15-56). If the equation of state is written as in 
Problem 15-14, show that 

n _ B 2 qK 2 + B U K + B 02 

(1 + K )2 

15-16. Define the surface concentration r — (TJ — 37°)/Ct. Extend Eq. 
(15-74) to read 

§ = P + B 2 (T)T 2 + • • •, 

and find an expression for B 2 in terms of u* and u. 

15-17. Relate the thermodynamic functions E„ S„ <p, and N, in Eq. (15-75) 
to partition functions. 



Supplementary Reading 


Fowler and Guggenheim, Chapter 7. 
Hirschfelder, Curtiss, and Bird, Chapter 3. 
Mayer and Mayer, Chapter 13. 

S. M., Chapter 5. 



Copyrighted Materials 

Copyright© 1986 Dover Publications Retrieved from www.knovel.com 


CHAPTER 16 

APPROXIMATE CELL AND HOLE THEORIES 
OF THE LIQUID STATE 

The general series expansion approach of the previous chapter is exact 
but in practice can be applied only to dilute gases. The treatment of dense 
gases requires higher virial coefficients, and it will be recalled that an 
investigation of the virial coefficient B n involves an n-body problem. Al¬ 
though the series method runs into computational difficulties when applied 
to dense gases, it breaks down altogether when liquid is present in the 
system; that is, there is a singularity in the function p(p) at the condensa¬ 
tion point, as already explained in connection with Eq. (15-1). 

Therefore we have to turn to new techniques in studying the liquid 
state. Any exact approach to this problem involves, in one form or another, 
the treatment of a system containing of the order of, say, 10 2 ° molecules. 
For example, in the canonical ensemble, the integral (6-22) with N — 
0(10 2 °) must be evaluated. Because of this possibly insuperable difficulty,* 
approximate theories of the liquid state are of great interest. We discuss 
approximate cell and hole theories in this chapter, and the distribu¬ 
tion function method in the next chapter. We restrict ourselves to a 
classical monatomic system with pairwise additive potential energy. 
Section 6-2 gives the criteria for the use of classical statistics. As usual, 
our object is to provide an introduction to the subject and not to review the 
latest refinements. 

We begin by discussing the van der Waals equation of state. This 
theory is of interest because of its simplicity, its prediction of a phase 
transition, and its resemblance to the Bragg-Williams lattice gas approxi¬ 
mation (Section 14-4). 

16-1 The van der Waals equation of state. If no forces act between 
the molecules of the system, 

a N V 

Q = h’ «-zr (16 - 1} 

as we found in Eq. (4-12). Here the partition function q pertains to a 


* High-speed computing machines seem to be the only hope in this connec¬ 
tion. Model systems with of the order of several hundred particles have been 
examined so far. See, for example, B. J. Alder and T. E. Wainwbioht, J. Chem. 
Phys. 31, 459 (1959), where other references are given. 
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single particle moving in a volume V which is potential-free. In order to 
derive the van der Waals equation, we assume that in a fluid of inter¬ 
acting molecules, each molecule moves, independently, in a uniform po¬ 
tential field provided by the other molecules, these being distributed in a 
random manner (hence the analogy with the Bragg-Williams theory). 
The potential energy between a pair of molecules is taken to be (see 
Problem 15-7) 

w (r) = +00 r < r* 

= -«(£)* r > r*. (16-2) 

This interaction potential has a “hard-sphere core” and the usual r ~ 6 
attractive term for large r. The minimum occurs at u = — e, r = r*. 

Because of the assumed independence of the molecules, we still have 
Q = q N /N\ but we make two modifications in q, both arising from inter- 
molecular forces. First, because of the hard-sphere core in each inter- 
molecular interaction, not all of the volume V is available for the motion 
of a given molecule. We therefore replace V by a “free volume” Vj. 
Second, we insert a Boltzmann factor e~‘ pl2kT to take care of the inter- 
molecular potential field in which the given molecule is moving. The 
energy <p, a function of N/V, is the potential energy of interaction between 
any one molecule and all others in the system. The factor of two is inserted 
in the exponent because each pair interaction has to be shared between 
two molecules in counting up the total potential energy. Thus we write 

a N V,e-* lUT 

Q ~ jy] ’ 5 (16-3) 

Now we turn to the explicit forms to be used for <p and V/. In the 
neighborhood of a particular molecule, the density of other molecules will 
be zero between r = 0 (location of the specified molecule) and r = r*, 
because of the hard core, and will be constant at the value N/V from 
r = r* to r — oo, because of the assumed random distribution of mole¬ 
cules. (Actually, the distribution is not random and the density not 
constant—see Chapter 17.) Between r and r + dr, for r > r*, there are 
(N/V)-4rr 2 dr other molecules. The potential energy of interaction be¬ 
tween each of these and the central molecule at r = 0 is given by Eq. 
(16-2). Therefore 



<P = 


2dvN 

~\T’ 


(16-4) 
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As the particular molecule of interest wanders through the volume V, 
a volume (4ir/3)r* 3 is excluded to it by each other molecule in the system. 
However, we have to divide this quantity by two for the same reason as 
above: the excluded volume arises from an intermolecular pair interaction, 
(16-2), and only half of the effect can be assigned to a given molecule. 
This argument will be confirmed by a direct calculation of B 2 below. 
Therefore we write 

2irr* 3 

Vf = V — Nb, b = • (16-5) 

Use of the volume Nb in Eq. (16-5) implies nonoverlapping, i.e., addi¬ 
tivity, of excluded volumes. This can only be correct in the limit of low 
densities, but we use this expression for V; at all densities, as an ap¬ 
proximation. 

Equations (16-3) through (16-5) now furnish us with a complete canonical 
ensemble partition function from which the various thermodynamic 
properties can be deduced. For example (Problem 16-1), 

- gy - In Q= ~ N ' n N + N, (16-6) 

and 

(p + (V - Nb) = NkT. (16-7) 

Equation (16-7) is the van der Waals equation of state. Thus the con¬ 
stants a, and b, defined in Eqs. (16-4) and (16-5), are the usual van der 
Waals constants, but here they are given expression in terms of the param¬ 
eters £ and r* of the intermolecular potential function. The subscript is 
included in a, to avoid confusion with a in Eq. (16-24). 

If we expand Eq. (16-7) in powers of 1/v — p = N/V, we obtain the 
virial expansion 

= P + (b — P 2 + b 2 p 3 + 6 3 p 4 + • • • (16-8) 

The second virial coefficient is B 2 — b — ( a v /kT ). If we multiply Eq. 
(16-8) by b and let 0 — pb (the maximum density corresponds to 6 = 1), 

+ + + < 1WI > 

This is very similar to Eq. (14-45) in the Bragg-Williams theory. 

The second virial coefficient, calculated from Eq. (15-24) and the 
potential (16-2), agrees in the limit of high temperatures (see Problem 
15-7) with B 2 in Eq. (16-8). This is to be expected, since the random 
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molecular distribution assumed in calculating <p is approached at high 
temperatures, and the excluded volume correction in Vf is correct if the 
density is low enough so that only pair interactions need be considered. 

The fact that the van der Waals equation exhibits critical and phase- 
transition behavior as in Figs. 14-4 and 14-7(b) is well known. This is, 
of course, the reason for our interest in the equation: it is mathematically 
very simple, but still predicts a first-order gas-liquid transition. The 
critical constants, from Eq. (16-7), are 


v e = 3 b, 


8 a 
27 bk’ 



(16-10) 


or, in terms of the parameters e and r*, 


kT e 


JS5 - 2 * = 6.28, ^ ^ = 0.296, 


Pc** ... 1 AA1W P&C 3 AAAB 

= 18i = 00177 ' fefe 8 0375 - 


(16-11) 


The individual values of the critical constants in (16-11) are in very poor 
agreement with experiment (Table 15-1). Hence we cannot take the 
van der Waals model seriously in a quantitative sense, though it is useful 
for qualitative purposes. 

Because of the form of the potential (16-2) [see Eq. (15-33)], we expect 
the van der Waals equation to obey the law of corresponding states. This 
is easily verified if we rewrite Eq. (16-7) as 


P» _ (v/b) _ a, 

kT ~ (v/b) — 1 (v/b)bkT 


(16-12) 


Now b = 2irr* 3 /3 and a v /bkT = f/kT. Therefore, pv/kT is a universal 
function of v/r* 3 and kT/e, as in Eq. (15-32). 

The two-dimensional van der Waals equation is of considerable interest 
as an approximate equation of state for an adsorbed monolayer. By the 
same reasoning as above (Problem 16-2), we find the equation of state 


(” + $■) (a “ Nb>) = NkT > < 16 ~ 13 > 

where 

0 ' = ™_, 5 ' = ^-- ( 16 - 14 ) 


If the critical properties of the same gas [with «(r) given by Eq. (16-2)] are 
compared in three dimensions and in two dimensions, we find T e (3 dim)/ 
T«(2 dim) = 2 (Problem 16-3). This result has been confirmed approxi¬ 
mately in several adsorption systems. 
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16-2 Cell theories of liquids. In a cell theory of liquids, we imagine 
the volume V divided up into a lattice of N cells, with one molecule in 
each cell. The motion of each molecule, within its cell and in the potential 
field of its neighbors, is assumed independent of the motion of the other 
molecules. This resembles the Einstein model for a crystal (the reader 
should review the beginning of Section 5-1), except that we shall be using 
classical mechanics here and the potential is not necessarily parabolic. 
The conversion of a cell model for a crystal into a cell model for a liquid is 
accomplished rather artificially by the introduction of the so-called 
“communal entropy”—absent in a crystal, present in a gas, and assumed 
also present in a liquid. Let us digress at this point to explain what is 
meant by the term “communal entropy. ” 

In a crystal each molecule is confined in a “cell” or “cage” formed by its 
nearest neighbors, and is only very rarely involved in excursions outside 
the cell. At the other extreme, in a dilute gas, each molecule is quite free 
to wander over the entire volume V of the container. Because of this 
additional freedom, gas molecules are said to have “communal entropy” 
not possessed by molecules in a crystal. The liquid state is intermediate 
in nature, and it is not at all obvious to what extent the liquid state pos¬ 
sesses communal entropy. Originally, Hirschfelder, Stevenson, and 
Eyring* assumed that the liquid state had essentially the complete com¬ 
munal entropy and that the communal entropy therefore appeared on 
melting as a large part of the entropy of fusion. This view was later 
criticized by O. K. Rice.f It can safely be said that the situation with 
regard to the communal entropy in the liquid state remains obscure even 
at present, although Kirkwood has given the concept rigorous formal 
definition. 

We now consider the simplest possible illustration of communal entropy. 
Suppose that we have a system of N monatomic molecules, without inter- 
molecular forces, in a volume V. Then, as in Eq. (16-1), 

V N ve 

A = -kT In Q = -kT In = —NkT In . (16-15) 

and 

3/2 

S = Nk In ^g- + Nk. (16-16) 

On the other hand, suppose, by the use of hypothetical partitions, that the 
volume V is divided up into N cells, each of volume v = V/N, and that 
each cell is occupied by a molecule which is restricted to move inside the 


* J. Hirschfelder, D. Stevenson, and H. Eyring, J. Chern. Phys. 5, 896 
(1937). 

t 0. K. Rice, J. Chem. Phys. 6, 476 (1938). 
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cell. Again we assume that there are no intermolecular forces. From the 
communal-entropy point of view this situation resembles that in a crystal. 
The configuration integral is now v N instead of V N ) also, the factor 1 /N\ 
in Eq. (16-15) is omitted, since the molecules are now distinguishable 
(the cells can be labeled). Hence 

A = -kT In <3 = —kT In ^ = -NkT In p, (16-17) 

and 

3/2 

S = Nk In ■ (16-18) 

Equations (16-15) and (16-16) may be compared with Eqs. (16-17) and 
(16-18). It is clear that if we start with the system divided up into cells 
and then remove the partitions, the system acquires in this process the 
additional “communal" entropy, AS = Nk. 

Now suppose that we take into account, very roughly, the inter¬ 
molecular forces in the cell model described above (for a crystal) by 
assuming that each molecule moves in a field of constant potential <p 
within its cell, where <p arises from the interaction of a given molecule 
with all the other molecules of the system. The only effect this will have 
on thermodynamic properties is to raise the energy of the system by N<p/2. 
Equation (16-17) becomes 

A = —NkT in JE + ^ ■ (16-19) 


As a next step, suppose the interaction potential between a given molecule 
and all other molecules is not assumed constant, but rather is a function of 
position in the cell, <p(r), where the origin r = 0 is located at the minimum 
in <p. For example, owing to intermolecular repulsions, we might expect <p 
to become very large in a condensed phase when the confined molecule 
approaches its neighbors at the edge of its cell. The probability of ob¬ 
serving the central molecule in a given element of volume is no longer 
uniform throughout the cell but must be obtained from a Boltzmann 
factor. As a result of this, the “effective” or “free” volume through which 
the central molecule can move is reduced to a value less than v. In fact, 
the cell configuration integral leading to Eq. (16-17), 

v = J dr, 


where A represents the cell, must now be replaced by 


v/ 



€ -[»(r)-»(0)I/*r 


dr, 


(16-20) 
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where jy is the “free” or “effective” volume. Equation (16-19) is then 
modified to read 

A = -NkT In g + • (16-21) 

In this picture we would expect #>(0) to be a function of p or v, and ty to 
be a function of v and T. This equation is very closely related to Eq. (6-3) 
for an Einstein crystal: iy/A 3 corresponds to q 3 in Eq. (5-3). The differ¬ 
ence is that (a) ty/A 3 is a classical partition function, and (b) the potential 
well inside the cell is not assumed to be parabolic in Eq. (16-20) (i.e., we 
are not limited to small vibrations), as it is in the Einstein model. 

Finally, to make Eq. (16-21) applicable to a liquid, we replace ty by 
tye, just as t> in Eq. (16-17) is replaced by ve in Eq. (16-15). That is, we 
assume the liquid has communal entropy since, in a liquid, a given molecule 
is not confined to a particular cell but can wander over the entire volume 
V. As already mentioned above, this is a strictly intuitive step which is 
found on more careful analysis to be inaccurate. Thus, for a liquid, 

A = -NkT In V/ ^P- + N<e %’ V) • (16-22) 

Before going on to a particular application of Eq. (16-22), we note that 
the van der Waals free energy, Eqs. (16-3) and (16—6), although not based 
on a cell model, can be put in the form of Eq. (16-22) with 

v/(v) = - p - b, <p(0;v) = (16-23) 

The van der Waals ty is a function of v but not T. 

We now turn to the cell theory of Lennard-Jones and Devonshire 4 ' (UD), 
based on Eq. (16-22) as a starting point. Many refinements of this theory 
have been worked out, but they have not changed the position of the UD 
theory as the basic prototype for cell theories of liquids. 

In the UD model, each molecule moves within its cell in the potential 
field of its nearest neighbors assumed fixed at the centers of their cells. 
To simplify the problem, the c nearest neighbors are treated as uniformly 
“smeared” over a spherical surface of radius a, where a is the distance 
between the centers of nearest-neighbor cells. Then 

o 8 = yv, (16-24) 

where 7 is a numerical constant (not an activity coefficient here) depend- 


* J. E. Lennard-Jones and A. F. Devonshire, Proc. Ray. Soc. {London), 
A163, 53 (1937); A165, 1 (1938). 
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Fig. 16-1. Cell geometry for the Lennard-Jones and Devonshire theory. 


ing on the geometry of the lattice. For a face-centered cubic lattice, 
7 = -\/2 and c = 12. Our task is to calculate (Eq. 16-20) 

v, = £ e~* w/kT iirr 2 dr, (16-26) 

where 

*(r) = <p(r) - <p(0). (16-26) 

In Fig. 16-1, the central molecule is at P, a distance r from the center of 
the cell. The area of the ring shown on the surface of the sphere is 

2jto 2 sin 8 dd. 

The number of "smeared” nearest neighbors in this area is 


2ira 2 sin 8 d8 


£ sin 8 dd, 


and the potential energy of interaction between the molecule at P and the 
neighbors in the ring is 

u(R) • § sin 8d8, 

where 

R 2 = r 2 + o 2 — 2or cos 8. 

Hence the total energy of interaction between the molecule at P and all 
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of its c neighbors is 


<p(r) = | J u(E) sin 6 dd, 


where we use the Lennard-Jones potential 


u(R) = —2e 




(16-27) 


(16-28) 


We substitute Eq. (16-28) into Eq. (16-27), and obtain, on carrying out 
the integration, 

+M “ <■« - *(») = “ [(£)'X5) - 2 (£)' m (ii)] ’ <lfr ‘ 29) 

where 

»(0) - «[-2(£)' + ($)"]’ 

l(x) = (1 + 12x + 25.2x 2 + 12x 3 + x 4 )(l - x)" 10 - 1, (16-31) 


m(x) = (1 + x)(l — x) 4 — 1. 


If we define 


•t .TO 

* p *3 r 

« = -r r == — > 

o 8 y 


(16-32) 


(16-33) 


Eqs. (16-29) and (16-30) become 


_ Hr) - HO) - «[(?)' (£) ~ 2 (tJ ”©]' <16 - 34) 
HO) = «[-2 (£)’ + (j)‘] • (16-35) 


When v/v* is small (< ~1.6), ^(r) has a minimum at r = 0 and rises 
rapidly as r increases. For large v/v*, as should be expected from the 
physical model, >p(r) has a low maximum at r = 0 , a minimum near 
r — a — r*, and rises rapidly when r > a — r*. 

The potential ^(r) in Eq. (16-34) is substituted in Eq. (16-25), and we 
find (putting y = r 2 /a 2 ) 

Vf = 2 ira 3 0 , (16-36) 

g = l exp (- ~ 2 (OH ) 2/1/2 dy - 
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The upper limit s in the integral is determined by the “boundary condi¬ 
tion” that, when ce/kT = 0 (ideal gas), 



Actually, the results for the critical and liquid regions are quite insensitive 
to the choice of «. 

Equations (16-22), (16-35), and (16-36) now determine all the thermo¬ 
dynamic properties of the system. For example, for the equation of state, 
we find (Problem 16-4) 


pv _ 
kT ~ 

2ce 
1 kT 

[©'-01 


-m 

(16-37) 

where 






91 = 

L “ p l 

-S[(S)V 

- 2 (?)”“] 

I y ll2 l(y) dy 

(16-38) 

and 






9m = 

l “ p ( 

- it[(t) i<b) ' 

- 2 (t)H 

| y U2 m(y) dy. 

(16-39) 


The quantities g, gi, and g m have to be calculated by numerical integration 
for each pair of values of v*/v and ce/kT. Extensive tables of g, g t , g m , 
and several thermodynamic properties have been published by Hirsch- 
felder, Curtiss, and Bird (Supplementary Reading list). The tables given 
actually include the contribution of second- and third-neighbor shells, but 
this refinement turns out to have little effect on thermodynamic properties. 
We can summarize the results as follows: 

(1) Since pv/kT above is a universal function of v/v* and kT/e, the law 
of corresponding states is obeyed. 

(2) A plot of pv*/kT against v/v*, for different fixed values of kT/e, 
shows typical critical behavior and loops of the van der Waals type. The 
critical constants (for 7 = y/2 and c = 12) are 


^ = 1 - 25 , 


kT e 


= 1.30, 


Per * 3 


0.614, 


gr = 0.590. 


(16-40) 
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Comparison with the experimental values in Table 15-1 shows that the 
UD theory predicts the critical temperature very well, but is not so 
successful on the other critical constants [though it is certainly an im¬ 
provement over the van der Waals results, (16-11)]. 

(3) The expansion of pv/kT in powers of v*/v has no linear term, so 
that the second virial coefficient is zero. This is a consequence of the fact 
that, in this model, each cell has exactly one molecule in it at all densities; 
at low densities, with this restriction, binary molecular interactions 
(responsible for B 2 ) become of negligible importance. 

We can conclude that the UD cell theory is completely unsatisfactory 
for a dilute gas and only fairly good in the critical region. But from the 
nature of the model, we would expect it to be quite successful, and it is, at 
very high pressures where the assumption of exactly one molecule per 
cell is reasonable. 

Incidentally, the two-dimensional theory is included in the UD refer¬ 
ences on p. 292. We shall quote just the one result that T c (3 dim)/ 
T e (2 dim) = 1.89, compared with the van der Waals value 2.00. 

16-3 Hole theories of liquids. A rather obvious refinement of cell 
theories of the UD type is to relax the restriction of exactly one molecule 
per cell. The most general approach would be to allow 0,1,2,... molecules 
in a cell. (If the intermolecular potential u(r) is assumed to have a hard- 
sphere core for sufficiently small r, there would be a maximum possible 
number of molecules in a cell of fixed volume.) Much work along these 
lines has been done, but we shall not review this area here. Unfortunately, 
these refinements do not improve the UD theory a great deal. 

A special case of generalized cell theories arises when the cells are 
chosen small enough so that the number of molecules per cell can be only 
zero or one. In this case, it is customary to speak of a “hole theory” of 
liquids—an empty cell being a “hole. ” Strictly speaking, this situation is 
possible only with a hard-sphere core in u(r); but in practice, since the 
Lennard-Jones potential rises so fast as r —» 0, this point can be ignored 
(except at very high temperatures). We consider here an approximate hole 
theory of liquids, due to Cemuschi and Eyring, which is in fact identical 
with the quasi-chemical approximation for a three-dimensional lattice gas 
(Section 14-5). In the lattice gas, a “site” is a cell and an “empty site” 
is a hole. 

We make the following specific assignments of parameters in Sec¬ 
tion 14-5. The lattice is assumed close-packed, so c = 12. The Lennard- 
Jones potential u(r) (with parameters « and r*) is used, and we choose the 
lattice spacing (nearest-neighbor distance between sites) as r*. This is 
perhaps the most natural choice, but is somewhat arbitrary. (A refine¬ 
ment would be to select the lattice spacing so as to minimize A for given 



16-3] 


HOLE THEORIES OF LIQUIDS 


297 


N, V, T.) Then, in Section 14-6, w = —The partition function q refers 
to the motion of a single molecule in its cell, and q = ty/A 3 in the nota¬ 
tion of Section 16-2. Clearly ty should be different for each different 
number and arrangement of nearest-neighbor molecules, but as an ap¬ 
proximation we assume ty is evaluated when all (c = 12) neighbors are 
present. Since the nearest-neighbor distance r* is fixed in this model, q is 
a function of T only. Because we shall consider just the equation of state, 
which does not depend on q(T), we need not pursue this question further 
[but an obvious evaluation of q(T), if needed, would be that offered by 
Eq. (16-36) with a = r*]. 

The equation of state is (14-62), if we put c — 12 and 


$ = 


pV _ pr* 3 
M y/2‘ 


Then, from Eq. (14-63), the critical constants are (Problem 16-5) 


jfe = V'S = 1.414, 



Ms 

kT c 


0.342. 



0.663, 

(16-41) 


Table 16-1 brings together the critical constants of Table 15-1 and Eqs. 
(16-11), (16-40), and (16-41). The hole theory of Cemuschi and Eyring 
is, on the whole, not as good as the UD theory in predicting critical 
constants. 

Equation (14-62) can be rewritten as pv/kT = (1/6) In ( }. Since 
6 = r* 3 /V 2 v and w = — pv/kT is a universal function of v/r* 3 and 
kT/e. Hence this model also obeys the law of corresponding states. 


Table 16-1 

Reduced Critical Constants 



Vc 

r *3 

kT e 

€ 

Per* 3 

€ 

PcVe 

kTe 

Experimental 


1.29 

0.179 

0.292 

van der Waals 


0.296 


0.375 

UD 


1.30 

0.614 

0.590 

Cernuschi-Eyring 

1.41 

2.74 

0.663 

0.342 
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16-4 Law of corresponding states. We have by this time encountered 
several special cases of the law of corresponding states in gases and liquids. 
We give here a brief argument showing how this law can be deduced in a 
general way.* 

Consider the canonical ensemble partition function for a classical 
monatomic gas, Eq. (6-21). Actually, the argument below applies also, 
approximately, to diatomic and polyatomic gases if the angular dependence 
of the intermolecular force is relatively unimportant so that the transla¬ 
tional partition function is separable from rotation, etc., or approximately 
so (in other words, if these molecules are effectively spherically symmet¬ 
rical). This rules out polar molecules, hydrogen bonding molecules, etc. 
From thermodynamics, we know that —A/NkT, being an intensive 
quantity, must be a function of v and T only. Then N~ l In Qn is a func¬ 
tion of v and T only. Therefore, from Eq. (6-21), N' 1 In (Zn/N\) is a 
function of v and T only, call it ti(v, T). That is, 

hi (§]) = NMv, T ), 
or 

§T = e *' N = T ) N ’ (1.6-42) 


where $2 = «**• Now we can rewrite Eq. (6-22) as 


where 


Zn 



U_ _ 
kT 


E 

I 


-UlkT 


“(&) • 


“(*•.;) 

kT 


!*-?**€*)• 


(16-43) 


if we assume a pairwise additive, two-parameter, intermolecular potential 
of the form (15-33), in which e and r* are energy and distance parameters 
(not necessarily identical with those in the Lennard-Jones potential) 
characteristic of each species of molecule. The function h is assumed 
the same for all gases under consideration. Then Zn, from Eq. (16-43), 
can be written 

Zn = r* 3 ^ 3 (j55? ’^ 3 ’ N), (16-44) 


* The original argument of this type is due to K. S. Pitzer, J. Chetn. Phys. 7, 
583 (1939). 
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where ^3 is the same function for all molecules. Next, on comparing 
Eqs. (16-42) and (16-44), we see that V's must in fact have the form 

where ¥ is also the same function for all molecules. This result gives us 


Z N = r* 3N N\*(±, ^ = Q n N\A 3N , 


or 


A 1 . n 
= v?ln Qn 


-P 


* a *(e/kT, v/r* 3 ) 


NkT N 
For the equation of state, we have then 


A» 


]• 


pv v (dA/NkT\ v (d In *\ 

kT ~ r*a V dv/r*3 ) T r *3 \av/r*s)., kT ' 


(16-45) 


(16-46) 


Therefore pv/kT is a universal function of v/r* 3 and kT/e, as was to be 
proved. 

In summary, then, we expect the law of corresponding states to be obeyed 
by those monatomic (or effectively monatomic, as far as translational 
motion goes) classical gases whose intermolecular potentials are pairwise 
additive and of the form (15-33) (all with the same function h). 

In Figs. 15-1 and 15-2 and in Table 15-1 we have seen experimental 
evidence for the existence of this law of corresponding states. A much 
more extensive summary and analysis of experimental data is given by 
Guggenheim.* 

Quantum fluids (e.g., H 2 and He) require an extension of the above law 
of corresponding states. This will be considered in Chapter 22. 


* E. A. Guggenheim, Thermodynamics. 3rd ed. Amsterdam: North-Holland, 
1957. See pp. 165-172. 
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Problems 

16-1. Derive equations for p, E/NkT, and S/Nk as functions of t> and T for 
a van der Waals fluid. Compare these with ideal gas equations. (Page 288.) 

16-2. Derive Eqs. (16-13) and (16-14) for a two-dimensional van der Waals 
fluid. (Page 289.) 

16-3. Show that T c (3 dim)/?, (2 dim) = 2 for a van der Waals fluid. 
(Page 289.) 

16-4. Deduce the UD equation of state from Eqs. (16-22), (16-36), and 
(16-36). (Page 295.) 

16-6. Verify the numerical values in (16-41) for the reduced critical con¬ 
stants of the approximate hole theory of liquids considered in Section 16-3. 
(Page 297.) 

16-6. Find Tb and Ta/T t ( Tb — Boyle temperature) for a van der Waals 
fluid. 

16-7. Find the second virial coefficient for the hole theory of Section 16-3. 
16-8. Show that if pv/kT is a universal function of v/r* 3 and kT/e, then 
pv/kT is also a universal function of v/v t and T/T c . The law of corresponding 
states is often expressed in the latter way. 

16-9. An ideal gas is adsorbed on a surface as a two-dimensional van der Waals 
fluid. Derive the adsorption isotherm. 

16-10. Two simple equations of state often used are: 

Berthelot: (p -f (i» — 6) — kT 

and 

Dieterici: p(v — b) = kTe~ a,llTl '. 


Find the second and third virial coefficients in each case. If each of these equa¬ 
tions is considered to have been derived from Eqs. (16-3), find <p. 

16-11. If the curvature at the bottom of the LJD potential well is used to 
determine the frequency v of an Einstein model, find v as a function of e, t, a, 
r*, and m (mass). Also, find the condition under which the UD potential well 
has a maximum at r — 0 instead of a minimum. 


Supplementary Reading 

Fowler and Guggenheim, Chapter 8. 
Hxeschfeldeb, Curtiss, and Bird, Chapter 4. 
Prigogine, Chapter 7. 

Slater, Chapter 12. 

S. M., Chapter 8. 
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CHAPTER 17 

DISTRIBUTION FUNCTIONS IN 
CLASSICAL MONATOMIC FLUIDS 

An alternative approach to the theory of liquids that has received much 
attention is the distribution-function method. In this chapter we give 
an introduction to the subject, using arguments that are more intuitive 
and transparent than would be the case in a formal and completely rigor¬ 
ous treatment. This type of discussion seems particularly appropriate here 
in view of the fact that a detailed survey of the formal type is already 
available in S. M., Chapter 6. 

For simplicity we restrict the discussion to a classical monatomic fluid 
with a pairwise additive intermolecular potential energy, though the ap¬ 
plicability of the distribution-function method is by no means limited to 
this special case. In fact, as will be intuitively obvious, in systems of ef¬ 
fectively spherically symmetrical polyatomic molecules, the radial dis¬ 
tribution function plays the same role as in monatomic systems. The cor¬ 
responding function in systems of polyatomic molecules that are not 
spherically symmetrical is obtained by integrating a more general, angular 
dependent, distribution function over all rotational orientations. 

17-1 Radial distribution function. Consider a one-component fluid 
with number density p and temperature T. Imagine that an observer is 
stationed on one particular molecule as it moves through the fluid, and 
that he makes observations on the number of molecules found at different 
distances from the (“central”) molecule on which he is sitting. Since the 
system is a fluid, the distribution of other molecules around the central 
molecule will be found, on the average, to be spherically symmetrical. 
The mean number of molecules observed in an infinitesimal element of 
volume dr at a distance r from the central molecule will not be, in general, 
just p dr, because the presence of the central molecule itself perturbs its 
immediate environment. For example, if the molecules of the fluid are 
hard spheres of diameter a, then no other molecule would be observed at 
distances r < a. Or, if the intermolecular potential is of the Lennard- 
Jones type, there would be a tendency for other molecules to accumulate 
at a distance of about r = r*. In any case, at large distances, where the 
influence of the central molecule has died out, the mean number of mole¬ 
cules p dr in dr would be approached. 

In general, we specify the mean number of molecules observed in dr 
by pg(r) dr, where g(r), called the radial distribution function, is the fac- 
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tor by which the mean “local density” pg(r) at r deviates from the bulk 
density p. Because all real molecules become effectively “hard” for r 
sufficiently small, g —» 0 as r —» 0. Also, as indicated above, g —» 1 as 
r —> co. 

The radial distribution function g(r) depends on density and tempera¬ 
ture. Therefore we shall sometimes write it as g(r, p, T). 

The function g(r) cannot be obtained by direct thermodynamic measure¬ 
ment, but fortunately it can be deduced by a nonthermodynamic experi¬ 
mental method: x-ray diffraction. The observed diffraction pattern is a 
sum of interference effects from pairs of molecules. The radial distribution 
function determines the relative weight [proportional to 4xr 2 jr(r)] in the 
sum given to different intermolecular distances. From the observed pat¬ 
tern, one can calculate backward to deduce g(r). Figure 17-1 shows typi¬ 
cal experimental curves in gas and liquid states. The peaks in the liquid 




(b) Liquid 


Fra. 17-1. Typical experimental radial distribution functions for (a) dilute 
gas and (b) liquid. In both cases the high peak occurs near r — r* (Lennard- 
Jones 6-12 potential). 
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curve are smeared-out remnants of relatively sharp peaks in g(r) in the 
solid state (obtained by averaging over different directions outward from 
the central molecule, since spherical symmetry is lacking). These peaks 
in g(r) for the solid correspond, as r increases, to first neighbors, second 
neighbors, etc. Thus the liquid has a certain amount of short-range order, 
about each molecule, which is a residue of the long-range order in the 
corresponding crystal. 

The best single experimental paper available on this subject is an ex¬ 
tensive one on argon.* 

The radial distribution function is of interest in statistical mechanics 
primarily because the thermodynamic functions of the fluid can be ex¬ 
pressed in terms of it, as we shall see in Section 17-2. Because of this, it 
becomes part of the task of statistical mechanics to provide the necessary 
framework from which g(r) can be calculated theoretically, at least in 
principle. This topic is the subject of Section 17-3. Comparison of theoret¬ 
ical g(r)’a with experiment can be made either directly with experimental 
g(r )’s (from x-ray work) or indirectly with experimental thermodynamic 
properties of the fluid. 

17-2 Relation of thermodynamic functions to g(r). The simplest con¬ 
nection to establish concerns the internal energy E. It follows immedi¬ 
ately from Eqs. (1-35) and (6-21) that 

E = §NkT + U. (17-1) 

The first term is the mean kinetic energy, and the second term is the mean 
potential energy. It is very easy to express U in terms of g(r). Consider 
any molecule as the “central” molecule. The total intermolecular potential 
energy between the central molecule and other molecules in the fluid at 
distances between r and r + dr is 

u(r) • pg(r) • 4wr 2 dr. 

Then V is obtained by integrating over all values of r and multiplying by 
N/2, since any of the N molecules might be “central.” The factor of two 
is inserted so that each pair interaction is counted only once. Thus 

Wr = I + 2& l u(r)9( - r > p > dr - (17_2) 

If, for a gas, we suppose g(r, p, T) expanded in powers of p as 

g(r, p,T) = g 0 (r, T) + pg^r, T) + p 2 g 2 (r, T) -\ -, (17-3) 


* A. Eisenstein and N. S. Gingrich, Phyt. Rev. 62, 261 (1942). 
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we see immediately from Eq. (15-83) that 

g 0 (r, T) = lim g(r, p, T) = e~ u{r)liT . (17-4) 

p-*o 

This is just what we should expect, for in the limit as p —> 0, only one 
molecule at a time need be considered in the neighborhood of the central 
molecule, and its spatial probability distribution will be determined by the 
Boltzmann factor (17-4) (see Fig. 17-la). This, incidentally, is equivalent 
to saying that the second virial coefficient involves pair interactions only. 
At higher densities, many molecules are in the vicinity of the central mole¬ 
cule, and the interactions between them influence the distribution about 
the central molecule. 

The expansion (17-3) is, of course, not valid for a liquid. Equation 
(17-2) must be used as it stands. 

Next, we consider the pressure in a fluid. Although there are several 
ways to do this, we calculate the pressure as the force per unit area which 
the molecules on one side of a mathematical surface S in the fluid (Fig. 
17-2) exert on the molecules on the other side. There are two contribu¬ 
tions to this force: the first, px, is associated with momentum transport 
and the second, pu, with intermolecular forces. Since the momentum 
distribution is independent of the existence of intermolecular forces (see 
Section 6-4), the contribution of momentum transport to the pressure is 
just the same as in an ideal gas at the same density and temperature, 
namely, px = pkT. So we have to consider only the second contribution, 
Pu- 



Fio. 17-2. Construction for calculation of pressure in a fluid. 
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In Fig. 17-2, the surface S is the plane x = 0. The force which a mole¬ 
cule at P a distance x from S exerts on a molecule at P' on the other side 
of S is — w'(r) = —du(r)/dr. The force normal to <S is —u'(r)h/r. The 
mean number of molecules in the ring (radius R) with values of R between 
R and R + dR and values of h between h and h + dh is pg(r)2irR dR dh. 
The normal force exerted by the one molecule at P on the molecules in 
the ring is then 

—w'(r) * • pg(r)2vR dR dh. (17-5) 

Now we change independent variable from R to r. Since r 2 = h 2 + R 2 , 
rdr = R dR. Hence (17-5) becomes 

— 2 irpu'(r)g(r) dr h dh. 

Then the normal force exerted by the molecule at P on all molecules on 
the opposite side of S (x < 0) is — 2irp/i(x), where 

/i(x) = u'(r)g(r) dr. (17-6) 


Finally, we will get pu if we add up the contributions — 2wpli(x) of all 
molecules in a cylinder of unit cross-sectional area and with axis perpen¬ 
dicular to S, extending from x = 0 to x = oo. The number of molecules 
in this cylinder between x and x + dx is p dx. Therefore 

p D = -2rp 2 £ I^x) dx. (17-7) 

Equation (17-7) can be simplified. An integration by parts (Problem 
17—1) reduces I\(x) in (17-6) to 

/x(x) = - y{ dr + I / r 2 u'(r)g(r) dr. (17-8) 

This result is substituted in Eq. (17-7), and two further integrations by 
parts are carried out. We find (Problem 17-2) 

pu = —2wp 2 (—$ + b) P r 3 u'(r)g(r) dr, (17-9) 

Jo 

where the two fractions result from the respective terms in (17-8). Finally, 
p — Pk + Pu, and hence 

kf “ p ~ &tL r ^ 9{r,p ’ T) ^ dr - 


(17-10) 
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Fio. 17-3. Radial distribution functions for a gas calculated from the 
Lennard-Jones 6-12 potential (ro is defined in Appendix IV). 

Equation (17-10) is valid for a liquid or gas. If, for a gas, we introduce 
the expansion (17-3) and then compare with Eq. (15-82), we confirm that 
g 0 = Comparison of Eq. (17-10) with Eq. (15-31) for Bg gives 

an expression for g t , but a simpler procedure is to use Eq. (17-20) (Prob¬ 
lem 17-3). Although g 0 has only one peak (at r = r*), as seen in Fig. 
17-l(a), with the inclusion of just one more term (that is, using g = 
ffo + Pffi) a second peak appears in g(r) (Fig. 17-3). 

The last thermodynamic function we consider is the chemical potential. 
This, together with E and p for given N, V and T, determines completely 
all the thermodynamic properties of the system (for example, A and S). 
Of course, if g(r, p, T) is known over the entire temperature range from 
T to T — oo, then A (and hence S and p) can be found by integrating 

( dA/T\ _ 

Kdl/T/jr.v ~ E 

from T to T — oo, using Eq. (17-2) for E and the limit g —► 1 when 
T —» oo (random distribution). But this is not a very practical suggestion 
since ordinarily g(r) is not available as a function of T. 

Instead, we introduce the idea, due to Onsager and Kirkwood, of a 
coupling parameter £ which can vary from £ = 0 to £ = 1. This is 
called a "char ging parameter” in electrolyte theory (Chapter 18). We 
imagine that the strength of the interaction between a particular central 
molecule and all other molecules in the fluid is reduced from its normal 
value by a factor £. [That is, u(r) for these interactions is replaced by 
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(u(r) or, with the Lennard-Jones potential, e is replaced by $€.] This is 
often expressed by the statement that the central molecule is “coupled” 
to the remaining molecules to the degree £. All other molecules interact 
normally with each other. Then the radial distribution function about the 
central molecule will be different than usual, and is denoted by g(r, p, T; £). 
The ordinary g(r) is g(r; 1). Of course, there is no way to measure g(r; {) 
directly, but theoretical equations determining g(r; £) can be derived 
(Section 17-3). 

We write the Helmholtz free energy of the fluid as A — A' + A", 
where A' is the hypothetical Helmholtz free energy the system would 
have (same N,V,T) if there were no intermolecular forces, and A" is the 
contribution of the intermolecular forces to A. Then 


where 


' - (m\. r ~*’ + *"■ 

(17-11) 


(17-12) 

"'-($ : )r.r- Wkr 

(17-13) 


Equation (17-12) follows from Eq. (4-23) or (15-18). The activity coef¬ 
ficient 7 is defined, as in Chapter 15, by z = X/A 8 = 7 p. Since N is a 
very large number, we can write 

M" = (^) r T = A"(N , V, T) - A"(N - 1, V, T). (17-14) 

Because of the relation between the Helmholtz free energy and work in 
thermodynamics, we may conclude from Eq. (17-14) that p" is the iso¬ 
thermal, reversible work that has to be done on the system against inter¬ 
molecular forces in order to add one more molecule to the system. More 
exactly, we mean by the above language that p" is the work done on the 
system (V and T constant) in passing from an initial state containing 
N — 1 molecules fully “coupled” with each other and 1 molecule not 
coupled with any of the others (that is, $ = 0) to a final state with all 
molecules fully coupled ({ = 1). The complete chemical potential 0*) 
is then this work (p") plus a contribution ip!) which depends only on num¬ 
ber density and temperature and not on the presence or absence of inter¬ 
molecular forces. 

We choose the one molecule that is orginally uncoupled ({ = 0) as 
the central molecule. For an arbitrary intermediate value of {, the radial 
distribution function about the central molecule is g(r, p, T; (). The 
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potential energy of interaction of the central molecule with another mole¬ 
cule at r is £u(r). Then w(r) d£ is the work that must be done on the sys¬ 
tem, because of this one interaction, if £ is increased by d£. Thus the work 
done on the system when £ is increased by d£, because of interactions be¬ 
tween the central molecule and all molecules between r and r + dr, is 

u(r) d£ • pg(r; £) • 4 vr 2 dr. 

The total work jtt" is just the integral of this expression over r and over 
£ from £ = 0 to £ =1. Therefore 

p = kT In A 3 + kT In p + kT In 7(p, T), (17-15) 

kTlny = pJ j°° u(r)g(r, p, T; £)4irr 2 dr d£. (17-16) 

This is our final expression for the chemical potential. 

From the discussion following Eq. (17-4), we expect that 

lim g(r, p, T; £) = e - tu(r)/tr . (17-17) 

p—*o 

If we substitute (17-17) for g in Eq. (17-16), and integrate over £, we ob¬ 
tain a result that agrees with Eqs. (15-20) and (15-24). 

17-3 Integral equation for g(r; £). In the preceding section we have 
derived equations relating thermodynamic functions of a fluid to the 
radial distribution function. In this section we obtain an integral equation 
that determines g(r; £) theoretically. The argument we use is intuitive 
in nature, but correct.* A rigorous argument requires a detailed discussion 
beyond the scope of this book (see S. M., Chapter 6). 

Consider a central molecule whose position, without loss of generality, 
can be regarded as fixed at r a (Fig. 17-4). We define the local chemical 



* T. L. Hill, J. Chem. Phys. 30, 1521 (1959); J. Phys. Chem. 61, 548 (1957). 
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potential p(ri 2 ) at rj, a distance r i2 from r 2 , as the work against inter- 
molecular forces necessary to “charge up* or “couple* a new molecule at 
r t plus kT In A 3 plus kT times the logarithm of the local number density 
at Tj. This is by analogy with Eqs. (17-12) and (17-13). Clearly, when 
ri 2 —* oo, the molecule at r 2 has no influence on the coupling process at 
r 1( and p( oo) = p. For convenience we rewrite Eq. (17-15) as 

H = ju(oo) = kTlnA 3 + kT In p 

+ P f 1 f u(r 13 )g(r 13 ] {) dr 8 d{. (17-18) 

JO JV 

That is, we use dx 3 to represent an dement of volume in the neighborhood 
of Tj. 

Because the system is at equilibrium, we assume that the local chemical 
potential /i(r 12 ), defined above, has a constant value everywhere. This is 
an extension to the molecular level of the thermodynamic principle of 
constancy of the chemical potential in phase equilibria. That is, /t(ri 2 ) = 
p(oo) = p. This is the intuitive step in the present argument, which, 
however, as indicated above, can be justified. 

Next, we write down an explicit expression for p(ri 2 ). First we note 
that the mean number density at r t is pg{r\ 2 ), which we use in place of p 
in the concentration term, (17-12). To calculate the work against inter- 
molecular forces in coupling a new molecule at r t , we have to introduce a 
new, higher-order, distribution function 0 l 3 l (r 1( r 8 , r 3 ; {). This is defined 
by stating that the mean number density at r 3 when a molecule is fixed 
at r 2 and when a partially coupled (£) molecule is fixed at ri is pg t3] . 
(The significance of the superscript is that three molecules are involved; 
the radial distribution function is often written with a superscript two.) 
The potential energy of interaction of the molecule at t x with those in 
dt 3 is, then, (u(ri 3 ) • pg m dr 3 . The corresponding work required to in¬ 
crease { by di is u(ri 3 ) pg l3] dr 3 d£. Also, the work involved in the inter¬ 
action with the molecule at r 2 is u(r t2 ) d£. On integrating over r 3 and f, 
we obtain for /*(ri 2 ), 

p(Xia) = kTln A 3 + kT\npg(ri 2 ) + «(fi 2 ) 

+ p[ 1 f u(r l 3 )g m dr 3 d{. (17-19) 

Jo JV 

We now equate m(oo) in Eq. (17-18) with /<(ri 2 ), and deduce 
—krinff(r 12 ) = «(ri 2 ) + p £ J y «(ri 3 )[ff I 8 '(ri, r 2 , r 8 ; {) 

- g(r 13 ;S)]dr 3 dt. (17-20) 
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This result is not quite general enough, for it pertains to g(ri 2 ; 1) but 
not to 0 (r ]2 ; £). To remedy this shortcoming, we use the same type of 
argument as above, but add a molecule at ri finally coupled only to the 
extent £ instead of £ = 1. This resembles adding to a fluid of one species 
(£ = 1) a molecule of a second species (£). Therefore we digress to briefly 
consider a binary solution. 

Let the species be a and 0, and let the number densities be p« and pp. 
Consider two infinitesimal elements of volume, dr a and dtp. The prob¬ 
ability that an a molecule is in dr a is p a dr a (since dt a —> 0, this is also the 
mean number of a molecules in dt a ). If an a molecule is in di„, let the prob¬ 
ability that a 0 molecule is in dip be ppg a p(r i 2 , p„, pp, T) dip. Then the 
probability that an a molecule is in dt a and a 0 molecule is in dtp is the 
product of the two probabilities, PaPpQ a p di a dip. This result has to be 
symmetrical with respect to a and 0; so g„p is not only, as above, the 
radial distribution function for 0 molecules about a central a molecule, 
but is also the radial distribution function for a molecules about a central 
0 molecule. 

Suppose we have a solution in which pp is very small, p* —► 0, so that 
in effect each 0 has an environment made up entirely of a molecules. 
Now if we add one more 0 molecule to the solution, by the argument lead¬ 
ing to Eq. (17-15) [see also Eqs. (18-17) through (18-19)], 

Up = kT In + kT In pp 

+ Paf l f u«p{r)g a p{r, p», T; £)47rr 2 dr d£, (17-21) 

Jo Jo 

where £ refers to the 0 molecule. The last term is very closely related and 
leads immediately to an expression for the Henry’s law constant of the 
solute 0 in the solvent a (Problem 19-14). 

We return to our generalization of Eq. (17-20). Consider a hypothetical 
“binary solution” containing a fluid of number density p (“solvent”) and 
a very few (“solute”) molecules of the same type but with coupling param¬ 
eter £. Let the “solute” number density be P(, where pj —> 0. A “solvent” 
molecule is fixed at r 2 , and we add a new “solute” molecule to the system 
at tj. If rj is far from r 2 , r i2 —► oo, then, just as in Eq. (17-21), 

Pl(oo) = kT In A 8 + kT In p { + p £ J y u{ru)g{ri Z -, £’) dr 3 d£'. (17-22) 

But if r J2 is finite, 

Pt(r 12 ) = kT In A 3 + kT In p^(r 12 ; £) 

+ fu(ri2) + P a u(r u )g m (n d! 3 d£'. (17-23) 
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Then from m(ca) = m(r u), 

-kT In g(r 12 ; {) = Su(r l2 ) + p f f u(r n )\g [Z \Ti, r 2 , r 3 ; {') 

jo Jv 

- Q(nz,V)\dudi'. (17-24) 

This is our final result, a generalization of Eq. (17-20), first derived by 
Kirkwood (but not by this kind of argument). 

Equation (17-24) relates a two-particle (n = 2) distribution function, 
g, to a three-particle (n = 3) function, 0 131 . If we choose two fixed mole¬ 
cules instead of one (as above), an equation analogous to (17-24) can be 
derived relating an n = 3 function to an re = 4 function (Problem 
17-4). This procedure can be carried on indefinitely, giving us a hope¬ 
lessly complex set of N interlocking integral equations. We should, in 
fact, expect a very involved result of this kind, because the original con¬ 
figuration integral over 3 N coordinates (Eq. 6-22) seems impossible to 
handle analytically and exactly, and there is no reason to believe that 
any alternative approach (e.g., the distribution-function method) will 
provide a solution which is fundamentally any simpler. 

In view of this situation, Kirkwood suggested an intuitively attractive 
approximation, called the “superposition approximation,” which serves 
to “close” the set of integral equations. The approximation is to assume 
that the influences of the fixed molecules at r ( (with £) and r 2 on the mean 
density at r 3 are independent of each other. Analytically, the approxi¬ 
mation is 

0 l 31 (ri, r 2 , r 3 ; £) = g(r 13 -, i)g(r 23 )- (17-25) 

If we put this in Eq. (17-24), we have 
— kT In g(r 12 ; {) = %u(r 12 ) 

+ P f [ «(r 13 ) 0 (r 18 ; - 1] dr 3 d?. (17-26) 

Jo Jv 

This is an integral equation in g(r; £), called the Kirkwood equation. It 
has been solved numerically for a fluid of hard spheres and for the Lennard- 
Jones potential (see S. M., Chapter 6), and thermodynamic functions 
have been calculated from the solutions. An alternative but essentially 
equivalent integral equation, due to Born, Green and Yvon, has also been 
solved numerically. The results in both cases are qualitatively com¬ 
pletely satisfactory, but only moderately accurate quantitatively, as 
should be expected since the superposition approximation is used. 

When applied to an imperfect gas, the superposition approximation can 
be shown (Problem 17-5) to be exact for B% but not B 4 . 
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17-4 Formal definition of distribution functions. In this section we 
show how distribution functions are related to the configuration integral. 
In a .more exhaustive discussion (e.g., in S. M., Chapter 6), we would use 
these relations as a starting point. 

Let us refer back to the paragraph preceding Eq. (17-21), and apply 
the argument to a one-component fluid. Then we have that 


pV 2 ) (*i. fa) dti dr 2 


is the probability that any one molecule of the fluid will be found in dr x 
and any second molecule in dr 2 , where g i2) is the radial distribution func¬ 
tion (the same as g above). To be more general, let us define p <n) (r 1( ..., r„) 
by the statement that p n g M dr x ... dt n is the probability that any one 
molecule will be in dri, another in dr 2 , etc. We now relate this probability 
to the configuration integral. 

As in Eq. (6-28), 

e~ HlkT fig* <ip x 
/ e~ HlkT dq z dp x 

is the probability that the system will be found in the classical state 
dqjipx. We integrate over the momenta and find that 

e- ulkT dr 1 ...dr N 

Z 


is the probability that molecule 1 is in dr j, ..., and that molecule N is 
in dr at. The probability that molecule 1 is in dr], molecule 2 in dr 2 , ..., 
and molecule n in dr„, irrespective of the configuration of the remaining 
N — n molecules, is then 

dri... dr„ f • • • f e~ ulkT dr n+ i... dt N 
- J - v -(17-27) 


Now, the probability that any molecule is in dri, any second molecule 
in dr 2 , ..., and any nth molecule in dr„, irrespective of the positions of 
the other molecules, is just (17-27) multiplied by N\/(N — n)l, since 
any one of N molecules can be in dt\, any one of N — 1 in dr 2 ,..., and 
any one of N — n + 1 in dr„. This probability is just what we called 
pV n> dri... dr„, above. So we have 


pV n) (ri, . • •, r„) = 


N\ 


(N - »)! 


/ J 

f-UIkT 

e dr„ + i. 

• • dr at 

J V J 




(17-28) 
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For many purposes, when n is a small number, we can use 



(17-29) 


Note that g M treats all the molecules in the set of n molecules equiva¬ 
lently. On the other hand, 0 131 and g l4 \ introduced elsewhere in this chap¬ 
ter [see Eq. (17-19) and Problems 17-4 and 17-12], single out one molecule 
for special consideration. 

Let us define a quantity w <n) by the equation 


ff <»> = "/w 


(17-30) 


The physical significance of w M can be seen as follows. We substitute 
(17-30) for g (n> in Eq. (17-28), take the logarithm of both sides of the re¬ 
sulting equation, and then differentiate with respect to the position of 
one of the n molecules 1,..., n, say molecule t. This gives 


—Vjto <n) = 


/, 

•J 

fe vlkT (—ViU) dr n+ i ...dr N 

J 

f* 

r J 

(e~ u,kT dr n+1 . ..dr N 


(17-31) 


Now — V,(7 is the force, /,-, acting on molecule i (for any given configura¬ 
tion ri, . . . , tsr). Therefore the right-hand side of Eq. (17-31) is the 
mean force acting on i, averaged over all configurations of molecules 
» + 1,..., N (the set of molecules 1,2,..., n being in a fixed configura¬ 
tion t u ..., r»). Thus, 

fi n) = —V,-w' n) . (17-32) 

This tells us that w> <n) is the potential whose negative gradient gives the 
mean force acting on any one of the molecules 1, ..., n. Therefore w (n) 
is called the potential of mean force. 

As a specific example, suppose we start with two molecules far apart 
in a fluid with given N, V, T, so that g (2> = 1 and w <2) = 0. We then 
move the two molecules together reversibly in the thermodynamic sense 
(i.e., extremely slowly) to within a distance r. The force against which 
work has to be done in this process is / (2) , and the work done on the sys¬ 
tem is w i 2 ) (r). In the limit as p —* 0, the two molecules are in effect in a 
vacuum, and w <2) (r) is just u(r). But at a finite density, the work w {2) (r) 
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is influenced by the presence of other molecules via statistical averaging 
(Eq. 17-31). In fact, Eq. (17-20) gives a general expression for t» < 2 > (ri 2 ), 
since —kT In g = w (2) . 

To extend the idea of distribution functions to open systems, we start 
with the probability 

jyj dti ... dr, 

___--- 

for a closed system with N molecules that any molecule is in dr 1( . . . , 
and any other one in dr n . (The subscript N is inserted here and below to 
indicate a closed system with N molecules; we have omitted the sub¬ 
script N until now in this chapter to simplify the notation.) This is the 
probability referred to preceding Eq. (17-28). In the notation of Chapter 
15, the probability that an open system contains N molecules is 


/ V fe- UNlkT dr n+1 ...dr lv 


_ ^ N Qv _ z N Zn 
S N \S ' 


(17-34) 


We therefore multiply (17-33) by (17-34) and sum over N to get the prob¬ 
ability that molecules are in dr 1( ..., dr„, irrespective of the locations of 
other molecules, in an open system (i.e., irrespective of N). We define 
0 <n> in an open system by equating this probability to pV n) dti . . . dt n , 
where p = J7/F. Thus we have that 


P V n> (ri,...,r») 


~s]? n (N-n)\f'vf e 


u » lkT dr n+l ...dr N . 

(17-35) 

Here the physical significance of g ln) is the same as for a closed system 
with N = 77. The two g M ’s differ only by a term of order 1/77. Although 
the open system g (n) appears to be more complicated, it has a number of 
advantages in theoretical work. 

We can define a io (n) for an open system by the relation (17-30) and 
again show (Problem 17-6) that w (n) is a potential of mean force. (This 
time the force is averaged not only over all configurations of molecules 
n + 1, ..., N but also over the number of molecules N.) The physical 
significance of w (n> is the same as ’ for a closed system. In the example 
above [preceding Eq. (17-33)], to i2 \r) is the reversible work done on an 
open system (p, V, T fixed) in bringing together two molecules from 
r — oo to r. 


17-5 Surface tension. In this section we illustrate the use of distribu¬ 
tion functions in an inhomogeneous region. The system we choose is a 
plane interface between a one-component gas and liquid. Our object is 
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/* = +l 

/ 



to relate the surface tension to g {2) . We use an argument, due to Kirkwood 
and Buff,* which is a generalization of that used for the pressure in Eq. 
(17-10). 

Consider a hypothetical macroscopic strip of zero thickness, width s, 
and height 21 , immersed in the fluid and extending from bulk liquid to 
bulk gas through and perpendicular to the interface (Fig. 17-5). The 
xy-plane is parallel to the interface and located in the transition region 
(the exact location is immaterial). We include in the thermodynamic 
system here only that part of the fluid in the area 8 X 21 and to the left 
of the strip (x < 0). Now imagine the strip moved parallel to the surface 
through an infinitesimal distance dx. Both the volume and interfacial 
area of the system change, so the work done by the system is 

pdV — 7 da = p • 2 U dx — 7 •« dx, (17-36) 

where 7 is the surface tension (not an activity coefficient) and a the sur¬ 
face area. 

The mean number density in this system is a function of z, p(z) (z is 
not to be confused with the activity). We define p'(z) as the force per unit 


* J. G. Kirkwood and F. P. Buff, J. Chem. Phya. 17, 338 (1949). 
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area which those molecules in the area s X dz at z and to the left of the 
strip exert on all molecules to the right of the strip. The total force ex¬ 
erted by the molecules in the system (x < 0) on those to the right of the 
strip is then Jl} p'(«)s dz. If we multiply this force by the displacement 
dx when the strip is moved, we get another expression for the work in 
(17-36). Equating these two expressions, we have 

p+i e-\-l 

sdx J t p'{z) dz = sdx J i pdz — ysdx. 

Therefore, 

y = f_ t lp-l/(z)]dz. (17-37) 

The limits can be replaced by ± oo because p'(z) —» p in either bulk 
phase, by definition. 

Thermodynamically, since 7 exists only when two phases are in equilib¬ 
rium, 7 has to be a function of only one intensive variable, say T. In Eq. 
(17-37), p (the vapor pressure) is a function of T, and p' a function of z 
and T. Also, p is a function of z and T. In the gas phase p(z, T) —> pa(T), 
and in the liquid p(z, T) —» Pl(T), where pa and pi are the bulk phase 
values. 

The remaining part of the problem is to express p'iz) in terms of g i2) 
(which we denote by g hereafter). Consider a molecule at the point P (r>), 
x u 2/i = 0, z i, and another at P' (r 2 ), x 2 , 2/2, *2, as in Fig. 17-5, where 
®i < 0 and x 2 > 0. The normal (i.e., x-component of the) force exerted 
by the molecule at P on the molecule at P' is — (x 2 — Xi)u'(r)/r, where 
r = |r 2 — rj|. The mean number density at r 2 when a molecule is fixed 
at Tj is p(z 2 )g(zi, z 2 , r), which defines the distribution function g. Since 
g depends on zj and z 2 as well as on r, this pair distribution function is not 
a “radial” distribution function. At large separations r, g —» 1; that is, 
g(z u z 2 , 00) = 1. The mean number of molecules in dr 2 , when a molecule 
is fixed at r 1( is p{z 2 )g(zi, z 2 , r) di 2 ) and the normal force exerted on these 
molecules by the molecule at ri is 

- — — * —— • pi?z)g{zu «2, r) dr 2 . 

The normal force exerted on all molecules with x 2 > 0 is then 

~ T f f ( X2 r Zl ) z *> r ) dx * d y* dz *< (17-38) 

—00 —00 0 

which we denote by —G(x 1 , zi). Finally, the normal force exerted on all 
molecules with x 2 > 0 by all molecules in the area s dz x at z t and to the 
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left of the strip is 

—sdzi JG(x u zi)p(ei)dx l . 

To obtain the contribution of intermolecular forces to p'(*i) (a force per 
unit area), we divide this last result by the area s dz\. In addition, just as 
in Eq. (17-10), there is a contribution p{z\)kT owing to momentum 
transport. Therefore 

P'M = p(*i)kT - p(zi) f° G(x u z{) dx h (17-39) 
where — G is given by (17-38). 

We change variables from r 2 to ri 2 , that is, from x 2 , y 2 , z 2 to 
X \ 2 — x 2 — xi, Via = y 2 , «i 2 = «2 — *i- Of course, r 2 = x, 2 + 
Viz + 2 i 2 - Then 

-J -00 + 0 ° +o° 

G(x u zi) = f J j (^r)u'(r)p(z l2 + zi)g(zi, z l2 , r) dx i2 dy i2 dz l2 . 

—« —00 —*1 

Next, we perform an integration by parts on the integral in Eq. (17-39). 
The integrand in the resulting integral over is an even function of xj. 
For symmetry, then, we extend the integration from — oo to +oo, multi¬ 
ply by 1/2, and also replace Xj by Xi 2 as the dummy variable of integration. 
This gives the final result 

p'(*i) = pMkT 

- ip(* 1 ) Jjf (^jr) P(* 12 + *i)?(*i, *i 2 , r) dx 12 dy 12 dz 12 , (17-40) 

—00 

which can be put into Eq. (17-37) for 7. 

It is not difficult to see (Problem 17-7) that Eq. (17-40) reduces to 
Eq. (17-10), as it should, in either bulk phase. 

In applying Eq. (17-40), we encounter essentially the same difficulty 
as in the theory of homogeneous fluids (Section 17-3). That is, the “singlet” 
distribution function p(z) can be related to the pair function p (2) (or g) by 
an integral equation (Problem 17-8), the pair function to a triplet function 
by another integral equation, etc. Exact solution of this set of equations 
to obtain p(zi) and g(z u z 12 , r) for use, for example, in Eq. (17-40) is 
impossible. 

Numerical calculations have been made, however, on two approxima¬ 
tions: (1) Fowler’s approximation,* in which an obviously unrealistic 
sharp discontinuity is assumed in the density at the interface (Problem 


* Kirkwood and Buff, loe. at. 
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-*/r*-► 

(a) 



-z/r* -- 

(b) 


Fia. 17-6. Nature of gas-liquid interface based on approximate model and 
the intermolecular potential (16-2). (a) Density as a function of z, at 

different temperatures, passing through the transition region. The maximum 
density is 8 = 1. (b) The quantity p — p', as a function of z. The surface 
tension (a function of temperature) is proportional to the area under the p — p' 
curves (Eq. 17-37). In both (a) and (b), the location of « = 0 is arbitrary for 
each curve. 


17-9); and (2) Hill’s approximation,* which presumably gives a correct 
qualitative picture of the transition region (Fig. 17-6) but which is based, 
nevertheless, on a very crude model. Approximation (1), together with an 
experimental radial distribution function for bulk liquid argon, leads to a 
value of 7 at 90°K for argon that differs from the experimental value 
(11.9 ergs-cm -2 ) by 25%; while approximation (2) gives a value of 7 
that is off by a factor of two. 


* T. L. Hill, J. Chem. Phys. 20,141 (1952). 
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Pboblems 

17-1. Choose u = /" and dv = hdh, integrate (17-6) by parts, and verify 
(17-8). (Page 305.) 

17-2. Deduce pv in Eq. (17-9) from Eqs. (17-7) and (17-8), using two 
integrations by parts: (a) choose dv — x 2 dx with the first integral in (17-8), 
and (b) choose dv *= dx with the second integral in (17-8). (Page 305.) 
17-3. At low densities, p —» 0, we can put, in Eq. (17-20), 

<?«) - 

Show, then, that gi in the expansion (17-3) is given by 

gi(r 12 , T) = e - ulr '* ,kT j[e~ u ^ ,kT - - i] dr 3 . (17-41) 

(Page 306.) 

17-4. Extend the argument leading to Eq. (17-24) to derive an integral 
equation for g 131 in terms of g l * 1, where pff [41 (ri, ..., 14 ) {) is the mean number 
density at when molecules are fixed at r 2 and 12 and a partially coupled (£) 
molecule is fixed at ri. (Page 311.) 

17-5. Prove that the superposition approximation gives the correct third 
virial coefficient. (Page 311.) 

17-6. Show that w M is a potential of mean force in an open system. (Page 314.) 
17-7. Show that p'(zi) in Eq. (17-40) reduces to p in Eq. (17-10) in a bulk 
phase. (Page 317.) 

17-8. Use the method of Eq. (17-15) to derive the following integral equation 
for the surface-tension problem (assume the local chemical potential has the 
same value at every point): 

trinp(zi) = kT In p„ 

+ J o f r uW(p«ff«(r; {) — p(«i 2 + «i)p(zi, * 12 , r; $)] *12 d{, (17-42) 

where a refers to either bulk phase. (Page 317.) 

17-9. Imagine that a sample of bulk liquid is divided into two parts by a 
plane of area Ct, and that the two parts are separated reversibly by gradually 



Figure 17-7 
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increasing d (Fig. 17-7) from d = 0 to d = °o. Assume that the (vapor) 
space between the two liquid parts has essentially zero density, and that each 
liquid part remains homogeneous right up to the vapor region, where there is 
a sharp discontinuity in density (from the liquid density p to p — 0). If a 
molecule is fixed at P (Fig. 17-7), the number density at P' is assumed to be 
pg(r), where g(r) is the bulk liquid radial distribution function. We can now 
calculate the normal force between the molecule at P and the other half of the 
liquid [compare Eq. (17-7)], and then we can calculate the total work necessary 
to pull the two liquid parts away from each other. This work is set equal to 
2 y&, since the amount of new surface area formed is 2G. Derive, in this 
way, the equation 

y = ilr L r * *sr s(r ’ p> T) dr> (17-43) 

originally due to Fowler. This can also be shown to be a special case of 
Eqs. (17-37) and (17-40). (Page 318.) 

17-10. Show schematically what you would expect the functions 4jrr 2 p(r) and 
g(r) to look like for a monatomic solid with cubic close packing [p(r) here is aver¬ 
aged over all directions outward from the central atom]. The successive num¬ 
bers of neighbors are 12, 6, 24, ... at distances a, V2a, V3o, .... From these 
curves guess the qualitative form of g(r) for the liquid state of the same 
substance. 

17-11. Apply Eq. (17-21) to the ease where the “solvent” is a dilute gas a. 
Show that this equation is consistent with Eq. (15-47), 

Zfl/Pfl = 1 — 6iip„ + • • •. 

17-12. Devise a formal definition of g l3] analogous to Eq. (17-28) and find 
the relation between p l3] and j (3) . 

17-13. Investigate the possible application of the law of corresponding states 
to surface tension. 
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CHAPTER 18 

DILUTE ELECTROLYTE SOLUTIONS AND PLASMAS 

Since a great many accounts of the Debye-Huckel theory of electro¬ 
lytes (based on the linearized Poisson-Boltzmann equation) are available, 
in Section 18-1 we shall give only a rather short discussion of this topic. 
The reader who desires a more detailed treatment should consult Fowler 
and Guggenheim, Chapter 9. The rest of the chapter is devoted to a 
less conventional approach to the problem, based on Kirkwood’s theory 
of solutions.* The distribution-function methods of Section 17-3 are ex¬ 
tended in Section 18-2 to solutions in general and, in Section 18-3, to 
electrolyte solutions in particular. It is shown in Section 18-3 that the 
Debye-Huckel limiting law can be deduced from the Kirkwood solution 
theory as well as by the usual (Debye-Hiickel) method of Section 18-1. 


18-1 Debye-Huckel theory. The model we consider here and in Sec¬ 
tion 18-3 is the following. The volume V is completely filled with a dielec¬ 
tric continuum of dielectric constant e which, for simplicity, is not con¬ 
sidered a molecular species of the system. Immersed in the continuum 
(see Fig. 18-1) are “hard” monatomic ions of diameter a, N{ of species i. 
Thus the system is in effect a “gas" mixture of ions in a continuous dielec¬ 
tric medium. If the continuum is a vacuum, e = 1 and the system is a 
real gas mixture (i.e., a plasma). The charge on an ion of species i is 
z,«, where e is the charge on a proton. In view of our continuum assumption 
(see Fig. 18-1), the interionic potential energy for an ij pair is 


Uij(r) = +oo 

_ x«xye 2 
er 


r < a 
r > a. 


(18-1) 


Of course, real ions are not hard spheres, 
and a real solvent does not behave like 
a continuum when r is small (say, 
r < 5a). Therefore Eq. (18-1) is only 
an approximation. However, this equa- 



Fio. 18-1. Model for “hard” ions 
immersed in a dielectric continuum. 


* J. G. Kirkwood, J. Chem. Phys. 3, 300 (1935). 
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tion becomes exact, in effect, in the limit of very dilute solutions, for 
then only large values of r are significant. 

We denote the concentration (number density) of species i by p< = 
N{/V. In order for the solution to be electrically neutral, we must have 
HiPtZi = 0 (which we call the neutrality condition). If the solution 
is not neutral, the excess charge will collect on the surface, and the prop¬ 
erties of the system will depend on the shape of container. 

If one is interested in temperature and pressure effects on an electrolyte 
solution, the solvent must be treated explicitly as a molecular species 
(as in Fowler and Guggenheim, Chapter 9). In this case, however, not 
much real progress can be made without giving up Eq. (18-1) for small 
values of r (say r < 5a) in favor of a detailed study of the behavior of 
solvent molecules around a pair of ions. We avoid all such complications 
in our brief treatment by adopting the continuum model already described. 

The basic thermodynamic equation for our system is 

dE = TdS - pdV + Y l l l i dN <- (18-2) 

{ 

The pressure p is the pressure of the “gas” mixture of ions. In reality, 
of course, if the continuum is a solvent and not a vacuum, p is the 
osmotic pressure of the electrolyte (this will be confirmed in Sections 19-1 
and 19-2). In a plasma (e = 1), p is the total pressure. 

Our object in this section is to give the Debye-Hiickel derivation of 
the ionic activity coefficient for an arbitrary species * in a very dilute 
solution. In the limit as pj —* 0 (all j), we would have for the chemical 
potential (Eq. 4-23) of species i, 

Pi = kT In A? + kT In p< (all pj —► 0). (18-3) 

Of course, if the molecular nature of the solvent were being taken into 
account, A? would be multiplied by a quantity (a function of pressure 
and temperature) closely related to the Henry’s law constant and deter¬ 
mined by the nature of the interaction between one ion and pure solvent 
[see Eq. (17-21) and Chapter 19], We are interested in the first-order 
correction to Eq. (18-3), when the solution is not quite dilute enough 
for that equation to hold: 

Pi = kT In A? -f kT In p< + kT In 7,-. (18-4) 

Perhaps the first thought to come to mind is that we have here, in effect, 
a dilute gaseous mixture of ions and hence that the methods of Chapter 15 
on imperfect gases are applicable. Thus, all we would need for the first- 
order correction mentioned above would be to calculate second virial 
coefficients as in Section 15-3. However, if we substitute the coulombic 
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potential energy, (18-1), in Eq. (15-52), we see [on writing e~ ulkT = 
1 — ( u/kT )] that the integral diverges because of the upper limit. In fact, 
the integral will diverge not only for the coulombic potential r -1 but 
also for r~ 2 and r -3 . Convergence begins with r ~ 4 (in the van der Waals 
interaction, we have r -6 ). The same is true for the higher virial coeffi¬ 
cients. The physical reason for the divergence is that with forces of 
sufficiently long range, it is not possible to treat a dilute system in terms 
of, successively, binary interactions, ternary interactions, etc., as the 
concentration increases. Instead, many-body or “collective” interactions 
appear even at the first departure from the dilute solution behavior of 
Eq. (18-3). 

Thus, a virial expansion does not exist for systems made up of particles 
interacting by long-range forces (pair potential energy «r~ n , n < 3); 
other methods must be used. 

We now turn to the Debye-Huckel argument, which leads to exact 
limiting expressions (at very low concentrations) but which is otherwise 
approximate. We consider the spherically symmetrical (on the average) 
neighborhood around one particular (“central”) ion of type j (as it moves 
through the system; or we can regard it as fixed). Let the mean electro¬ 
static potential at a distance r from the j ion be ^,(r). The zero for ^y(r) 
is chosen at r = oo. Also, let the mean charge density at r be w,(r). 
Then Poisson’s equation in electrostatics states that 

VV/(r) = - (r > a). (18-5) 

Because of the presence of the central j ion, ^(r) will be nonzero for 
finite r [^y(r) will have the same sign as ary], and also n,(r) will be nonzero 
for finite r [»y(r) will have a sign opposite to that of z, because of a net 
accumulation of ions of opposite charge]. The region over which ^y(r) 
and ny(r) differ significantly from zero is called the ion atmosphere of the 
central j ion. As r —> co, n,(r) —» 0 because of the neutrality condition. 

The general plan is to convert Eq. (18-5) into a tractable differential 
equation in ^/(r), to solve the differential equation, and finally to use 
^y(r) thus found to calculate the activity coefficient 7/ by a charging process 
(as in Chapter 17). 

From Section 17-3, we have that the mean concentration of ions of 
type i a distance r from a central j ion is 

Pi(r) = Piff.j(r) = p<e~ Wi > lr)lkT , (18-6) 

where u>»y(r) is the potential of mean force. Since, with our model, no 
ion can be closer to another than r = o, p{(r) = 0 and w,,(r) = +oo for 
r < a. At r = oo, = 0, ga = 1, and the concentration of i ions is 
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the bulk concentration p,. The first approximation we make in the Debye- 
Hfickel theory is to assume that p{(r) is determined by a simple Boltzmann 
factor, 


pkr) = p < e-^ (r,/iT (r > a). (18-7) 


This is analogous to assuming that in an electrochemical equilibrium 
between two bulk phases a and 0 , with a potential difference ^ 


_ a-zMikT 

Pi = Pie , 


with neglect of activity coefficients. This analogy will be pursued more 
explicitly in Section 18-3. Comparison with Eq. (18-6) shows that the 
assumption (18-7) also amounts to setting u>«(r) = Xie^y(r) for r > a. 
Exact expressions for ioy(r) and z,«^y(r) will be compared in Section 18-3. 
Although Eq. (18-7) is in general not exact, it proves to be correct in 
the limit of very dilute solutions. 

Equation (18-7) makes it possible to write Eq. (18-5) as a differential 
equation in ^y(r) only. The charge density at r is 

»i(r) = e'^zipiir) (18-8) 

i 

= «£ziP,«-* < *' lr)/ * r (r > a). (18-9) 


Then substitution of Eq. (18-9) for n,(r) in Eq. (18-5) leads to the so- 
called Poisson-Boltzmann differential equation in \f>j(r). This equation is 
nonlinear and very difficult to solve in general (except by numerical or 
series methods). But much effort to solve the equation is probably not 
justified in any case, since Eq. (18-7) and therefore Eq. (18-9) are approxi¬ 
mations in the first place. 

To make further progress, we limit the discussion to cases in which 
zMj{r)/kT « 1 for the important values of r. This is the second Debye- 
Huckel approximation; it allows us to linearize the Poisson-Boltzmann 
equation. After solving the linearized equation, we can come back to see 
what is meant by “important values of r” and for what electrolyte con¬ 
centrations linearization is a good approximation. It will turn out that 
this approximation, like the first one, is justified if the solution is 
sufficiently dilute. 

Therefore we expand the exponential in Eq. (18-9) up to the linear 
term. The Poisson-Boltzmann equation becomes 
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or 

1 d 2 (ii/j) 2, , ^ ^ 

r dr 2 = * ** (r * a) ’ 

(18-10) 

where 

2 4ire 2 v - ' 2 

(18-11) 


The leading term in the expansion does not contribute to Eq. (18-10) 
because of the neutrality condition. The summation in Eq. (18-11) is 
twice the “ionic strength, * often used in thermodynamics (and expressed 
in moles-liter -1 ). Thus (ionic strength) 1/2 and k —> 0 as the solution 
approaches infinite dilution. When we rewrite Eq. (18-10) in the familiar 
form 


d 2 (r^j) 

dr 2 


= k 2 Wo), 


the solution is obviously 

rfj = C x e~" + C 2 e+* r . 

Since we know that -* 0 as r —> oo, we have to choose C 2 — 0. 
Therefore 

Ur) = (r > a). (18-12) 


To evaluate C i we must consider the situation inside the sphere r — a 
(Fig. 18-1), where no charges can be present except the charge z,e at 
r = 0. Laplace’s equation is therefore satisfied inside the sphere. Let 
<pj be the inside electrostatic potential. Then 




(r < a). 


Successive integrations give 


and 


d(r-Pi) = c 
dr 3 


Ur) = C s + ^ (r < a). 


(18-13) 


When r —* 0, <pj must approach the coulombic potential zjt/er. Therefore 
C* = zfl/t. To evaluate the remaining two constants, C\ and Cz, we use 
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the two boundary conditions at r = o: 

tM) = <f}(«■), 

#(«) = Vjfa)- 

Equations (18-12) through (18-14) give 


Ci = 


zyee 


«(1 + na) 


Cz= ~ 


zjcK 


«(1 + ICd) 


Therefore 


*' (r) = fe ) (r * “>• 


—«(r— a) 


(18-14) 


(18-16) 


We also note that, in Eq. (18-13), CJr is the contribution to the potential 
inside the sphere owing to the central j ion itself, and hence C 8 must be 
the contribution of all the ions outside the sphere (i.e., the contribution 
of the ion atmosphere). We denote the latter potential by This 

quantity is a constant. Hence, at r = 0, where the central charge zy* 
is located, we have 

- c >- - w+zry <1Me> 

We can now use Eq. (18-16) to calculate the electrostatic contribution 
to 7y. The connection is established by extending the argument of 
Eqs. (17-11) through (17-14) to solutions [in fact, we have already applied 
it to a binary solution in Eq. (17-21)]. We have 


My == My + My', 


(18-17) 


M* = (Uf) = kT In A® + kT In py, (18-18) 

\9Nj/tf a!dj ,v,T 

= kT In 1 j, (18-19) 

r.V.T 

where kT In Yy is the isothermal reversible work, against intermolecular 
forces, required to add one more molecule of species j to the system. 
For simplicity, we calculate the electrostatic contribution to kT In 7y 
using the work of a charging process, and then add on separately the 
hard-sphere interaction contribution, since the latter is already available 
from Chapter 15. To compute the electrostatic work of adding one more 
j ion, we introduce the charging (coupling) parameter £y on the j ion. 
For an arbitrary value of (y, the j ion has a charge {yrye, and hence 



*f(0;{y) = - 


_jyzy?i_. 

e(l + tea,) 
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If we bring in an increment of charge zjo d(j to the j ion, the work done 
against the ion atmosphere (i.e., against the electrostatic contribution to 
the intermolecular forces) is ♦£ tm (0; Therefore, 

r*—f' 8 - 20 * 

is the electrostatic contribution to kT\n 7,-. 

For the hard-sphere contribution to kT In 7 j (which would remain even 
if we put each zj = 0) in a dilute solution, we merely have to extend 
Eq. (15-47) for a binary gas mixture to an arbitrary number of compo¬ 
nents. In the binary case [and in the notation of Eq. (15-47)] we have 
from Eqs. (15-24), (15-25), and (15-52), 

In 7 2 = Bjipi + 2 B 02 P 2 + • • • 

= —7j— (fii + P 2 ) 4— •• 

For any number of components, then, we obviously have 

lnl= (allj). (18-21) 

Finally, then, for the electrolyte solution, 

ln y ’ = ~ 2tkT{\ + m) + ~T~ £ P,% (18-22) 

This is a statistical-mechanical equation for the activity coefficient of a 
single ionic species. Of course, in thermodynamics we can measure activity 
coefficients only for neutral salts, so only “neutral” combinations of 7/s 
from Eq. (18-22) (e.g., 7 + 7_, 7 ++ 7i, etc.) can be checked experimentally. 
It is well known that excellent agreement is achieved between experi¬ 
mental activity coefficients and the limiting form of Eq. (18-22) at high 
dilutions, but not always with Eq. (18-22) itself. At high dilutions, we 
have the so-called Debye-Hiickel limiting law, 

ln 7>--*-fS ( 18 ~ 23 > 

We note that ln 7/ is proportional to the square of the charge on a j ion 
and to the square root of ionic strength. 

Equation (18-22) has the disadvantage of reflecting the approximate 
nature of our choice of u#(r) in (18-1). It is therefore not surprising that 
Eq. (18-22) is not always in satisfactory agreement with experiment. 
On the other hand, Eq. (18-23) does not involve a. In fact, it is rather 
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obvious on physical grounds that Eq. (18-23) must depend on the validity 
of Coulomb’s law, «y(r) = za^/er, only for r » a, because in a very 
dilute solution the ions are in general far apart. But we may have con¬ 
fidence that Coulomb’s law (with e) will become exact for large r; there¬ 
fore we expect Eq. (18-23) to be an exact limiting law provided that 
our approximations, (a) tcy(r) —+ r<e^,(r), and (b) Zi»pj(r)/kT <K 1, become 
valid in a very dilute solution. The fact that the Debye-HQckel limiting 
law is verified experimentally indicates that this is indeed the case. On 
the theoretical side, we shall confirm (b) below and (a) in Section 18-3. 

We can examine more carefully the point above about the importance 
of r a for dilute solutions by investigating the (net) charge density 
distribution (the “ion atmosphere”) about the central j ion. From 
Eqs. (18-10) and (18-15), 


n/r) 


ck% = _ zjeK 2 e ~* (r ~~ <,) 
47r 4irr(l + xa) 


(18-24) 


That is, n,(r) « e~"/r. Note that n } (r) has a sign opposite to Zj, as 
expected. The net amount of charge between r and r + dr is proportional 
to r 2 rij(r), or re~ Kr /(r). The function x/(r) is plotted in Fig. 18-2. 
This figure shows the form and extent of the ion atmosphere. In par- 



nr 

Fio. 18-2. Relative net amount of charge between r and r + dr in ion at¬ 
mosphere around a particular ion in a dilute electrolyte solution. 
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ticular, the maximum in /(r) occurs at r = 1/k, as is easy to verify 
(Problem 18-1). Thus the most important region of the ion atmosphere 
is in the neighborhood of r = 1 /k. This region moves to larger values of 
r as the solution becomes more dilute (k —> 0). This confirms the fact 
that the behavior of «,y near r = a is unimportant for the limiting law. 

To obtain some feeling for the orders of magnitude involved, we can 
calculate (Problem 18-2) that in an aqueous 1-1 electrolyte solution at 
25°C (e = 78.5), 

1=^2 ( inA )> (18-25) 


where c is the concentration of salt in moles-liter -1 . Thus, if c = 
0.01 moles-1 -1 , 1/k — 30.4 A. By comparison, a is of order 3 or 4 A, 
so that 1/k = 0(10a) or Ka = 0(1O -1 ) at c = 0.01 moles-1 -1 . At about 
this dilution, then, we should expect the limiting law, (18-23), to begin 
to be rather accurate [again assuming approximations (a) and (b) above 
to be valid], and this is found to be the case. If c = 10~ 4 moles-1 -1 , 
1/k = 304 A, etc. 

At this point, we are in a position to go back to the question of the 
conditions under which the linear approximation, (b), made in writing 
Eq. (18-10), is legitimate. We require that 


kT 


« 1 . 


As we have seen, the important values of r are those in the neighborhood 
of r = 1/k. Therefore we put r = 1/k in Eq. (18-15), and obtain (drop¬ 
ping factors of order unity) 


• 2 k 

ekT 


« 1 


as the necessary condition for the validity of the approximation. This 
is the same as 


! » —. 
k * ekT 


(18-26) 


For water at 25°C, this gives the condition 1/k » 7.1 A (Problem 18-3) 
(that is, 1/k » 2o). Hence we come to the conclusion that for e < 
0.01 moles-1 - ’ (1-1 electrolyte), where 1/k > 30 A, the linear approxi¬ 
mation should begin to be a good approximation. For c < 10 -3 or 10 -4 
moles-1 -1 , it should be an excellent approximation. 

The Debye-Hiickel argument itself cannot provide any theoretical 
justification for approximation (a) in a dilute solution [w,->(r) —* z^jir) 
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as k —* 0], as it does for approximation (b) [see (18-26)]. However, as 
already mentioned, there is indirect experimental evidence supporting both 
(a) and (b). We shall return to (a) in Section 18-3. But let us note here 
what the Debye-Hiickel theory has to say about wa(r). From Eq. (18-15) 
we have 

w.j(r) = +oo (r < a) 

2 —«c(r—a) 

“ " *%•+«.) <r a •>• (‘W 

In very dilute solutions, *ca —> 0 and 

t Vij(r) _> Z —(18-28) 


As implied by earlier remarks, Eq. (18-28) proves to be exact, as a 
limiting law, but Eq. (18-27) is an approximation. 

Incidentally, since by definition (see g„s in Section 17-3) w,-,(r) = wa(r), 
we can deduce the symmetry condition zdj{r) = zrfi(. r ) from approxi¬ 
mation (a), Wij = Z(Vpj, which was used in arriving at the Poisson- 
Boltzmann equation, (18-5) and (18-9). This symmetry condition is 
seen to be satisfied by Eq. (18-15), which is a solution of the linearized 
Poisson-Boltzmann equation. But when the complete (nonlinear) Poisson- 
Boltzmann equation is used, it turns out that the solution of this equation 
does not satisfy the symmetry condition in general. Thus we can be sure 
that approximation (a) is incorrect except when the electrolyte solution 
is dilute enough (Eq. 18-26) to justify linearizing the exponential in 
Eq. (18-7). 

Finally, we turn to a consideration of thermodynamic functions (Prob¬ 
lem 18-4) derivable from the Debye-HQckel limiting law, (18-23). From 
Eqs. (18-2), (18-4), and (18-23), we have 


where 


£ N m =Y,N i \n Afp,- + §, (18-29) 


Pel e 2 «C V-* 2 _ 1(3 V 

kT~ 2ekT ai ~ Sir ‘ 


(18-30) 


We use F e i to represent the electrostatic contribution to F. To find the 
(osmotic) pressure, we employ Eq. (18-29) and the thermodynamic 
relationship, 
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We multiply this equation by dV/V, integrate from V to so, and obtain 


kT ~~ *-> Pi 24t 


24ir 


(18-31) 


The Helmholtz free energy is then 



where 

A e i _ e 2 x y' \r 2 _ _ * 3 V 

kT ~ 3ekT iZi ~ 12ir 


(18-32) 


(18-33) 


As a check on Eq. (18-33), we can make use of a different charging 
process, one in which we start with all ions discharged and then charge 
all ions at the same rate up to the final charge. Thus there is a single 
charging parameter £. In this process, if carried out reversibly with 7’, V, 
and the IV,- held constant, 

A A = A„i(* - 1) - A„(« = 0) = Aei(£ = 1) = W eI , 


where W«i is the work done on the system in the charging process. When 
all ions are charged to the degree £, we have from Eq. (18-16) that 

.atm/ a. ft _ £*>**(£) 

Vi ( 0 , £) =--— 

_ _ t 

e 


since ic(£) = £/c according to Eq. (18-11). Then the work necessary to 
bring up an additional charge z/e d£ to one j ion is * > * tm (0; f)zy*d£. Hence 
the total work necessary to charge up all the ions is 



in agreement with Eq. (18-33). 


18-2 Kirkwood theory of solutions. In this section we extend the 
methods of Section 17-3 to solutions of monatomic molecules. As in that 
section, we use [in Eq. (18-37)] an intuitive argument (not employed by 
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Kirkwood*) to avoid involvement in the detailed formal properties of 
distribution functions. The results we find here will then be applied in 
the next section to electrolyte solutions in particular. 

We begin with Eqs. (18-17) through (18-19), which apply to any solu¬ 
tion of monatomic molecules: 

to = kT In A? + kT In p< + kT In 7,-, (18-34) 

where kT In 7 i is the work Wi, against intermolecular forces, required 
to add one i molecule to the system. We now derive an equation for Wi. 
The average number of molecules of species s in the element of volume 
dr, in the neighborhood of a partially coupled molecule of species i is 

p,e~ u>u{ru,U)lkT dr„ (18-35) 

where w,-, is the potential of mean force. The contribution to Wi, arising 
from interactions with these molecules, when {,• is increased by d&, is 
(18-35) multiplied by u,-,(r,-,) d{,-. Therefore, if we integrate over r, 
and and sum over s, we have 

Wi = kT In 7, = £ P./J j y «,yr w< ‘ < * <)/i7 ’dr. df,. (18-36) 

This is the generalization of Eq. (17-16) for solutions. 

Consider an element of volume dr,- near a fixed molecule of species j 
and the equilibrium between i molecules in du and i molecules at a very 
large distance from the fixed j molecule. We can write 

oo) = kT In A? + kT In p< + kT In 7< 

= P,(r„) = kTlntf + kT In pfr*) + W'(r«), (18-37) 

where is the mean concentration of i molecules in dr,-, and W( repre¬ 
sents the work that has to be done against intermolecular forces in order 
to add an i molecule to the system at T t . 

The average number of s molecules in dr„ when a j molecule is fixed 
at Tj and a partially coupled i molecule is at r,-, is 

dr„ 

where «>*,• is a potential of mean force defined by 

ff t3I ( r «, tj, r,; {,) = e-^ lkT , (18-38) 

* J. G. Kirkwood, J. Chem. Phys. 3, 300 (1935). 
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in the notation of Section 17-3. The physical significance of to,*- is the 
following: it is the reversible work which must be done to move an s mole¬ 
cule from infinity through the solution up to the position r„ when a par¬ 
tially coupled ({<) i molecule is fixed at r< and a j molecule is fixed at ry 
during the process. In the superposition approximation, (17-25), we 
have 

«>*;({,') = to,-,({<) + to,-., (18-39) 

but this is not true in general, because the effects of the molecular en¬ 
vironments of the i and j molecules on the s molecule are not simply addi¬ 
tive but perturb each other. In the limit as p,- —► 0 (all t), that is, a 
very dilute gas, 

»*,({<) -» {,t u. + Uj„ 

and (18-39) is correct (if the intermolecular potential energy is itself 
pairwise additive). 

The total work of coupling i at r,- is then 

Wfati) - « n(rn) + 'Ep.f 1 ( dt, dii. (18-40) 

. 'O Jv 

We also have the relation 

pkm) = pfi- Vi W kT . (18-41) 

On combining Eqs. (18-36), (18-37), (18-40), and (18-41), we obtain 

i Dij(ra) = u n( r a) + ’EpJ 1 f Ui.[e- w ' ii{(i)lkT - e- Vi ‘ lti)lkT ]dt.dii. 

(18-42) 

This is the generalization of Eq. (i7-20) to solutions. 

We can deduce a more general equation than (18-42), for tpy({,), in 
exactly the same way as we obtained Eq. (17-24) from Eq. (17-20), 
but we omit this derivation. The final result can easily be written down 
on inspection of Eqs. (17-20), (17-24), and (18-42). 

Also, we can extend the above argument without difficulty to succes¬ 
sively larger groups of fixed molecules. For example, if we couple a new 
molecule of species i near fixed j and k molecules, we find (Problem 18-5) 

v>% = wn + ua + ^ P./J f r Ui,[e- w ^ ti)lkT - e dr, d{,-. 

* ° V (18-43) 

Finally, we note that Eq. (17-10) for the pressure of a monatomic fluid 
can easily be extended to fluid mixtures (Problem 18-6): 
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i r 

yCyt = y / P* QkT y ■ dfi 

where Qa — e~ m ‘> lkT is a function of all the p’s and T. 


(18-44) 


18-3 Electrolyte solutions. In this section we apply the equations of 
Section 18-2 to the idealized electrolyte model described in the first 
paragraph of Section 18-1. Here again the solvent, if any, is considered 
a nonmolecular continuum. We first write down general equations and 
then observe that the Debye-Hiickel limiting relations furnish a solution 
of these general equations in the high dilution limit. This provides a 
more firm statistical-mechanical foundation for the Debye-Hiickel limi t- 
ing law than the Debye-Huckel argument (Section 18-1) itself. 

We begin with Eqs. (18-37). It is now convenient for us to write Wi 
in the form 

W’iirtj) — kT In y J i(r { j) + *,*^,(r fi ) (r,y > a), (18-45) 


where ^y(ry) is the mean electrostatic potential a distance r,-, from a 
fixed j ion [with ^y(oo) = 0], and where 7$ is a “local activity coefficient,” 
defined by Eq. (18-45) in terms of W{ and tf-y. The point of separating 
W{ in this way is that now Eqs. (18-37) give 

Pkm) = p < e-*^ r « > J^^] (r„ > a), (18-46) 


which (a) is analogous to an electrochemical phase equilibrium with 
activity coefficients included [see the discussion following Eq. (18-7)], and 
(b) furnishes the explicit correction factor by which the approximation 
(18-7) has to be multiplied to make it exact. Of course, the introduction 
of 7{. does not provide anything new and can be avoided, if desired, in 
view of Eq. (18-41): 






<Tii > «)• 


(18-47) 


Qualitatively, we would expect 7j(r,y) y* for finite r,y since (a) the 
mean local ionic strength at r,y is different than in the bulk solution 
(r,y =8 oo), and (b) the atmosphere around an t ion at r,y is not sphericaUy 
symmetrical as it is at roo. The exact Poisson-Boltzmann equation is 


vV,(r) = - (r > a) (18-48) 

« 


♦ 



(r > a). (18-49) 
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An explicit expression for the mean electrostatic potential at r, 

a distance r,/ from a fixed j ion, may be written as follows. The contribu¬ 
tion to \f/j owing to an «ion at r, is r,e/er<,. The contribution of all s ions 
in dr, is then 


tr - P,e 
»(» 


Hence the total potential at r,-, arising from the fixed j ion and its ion 
atmosphere, is 


iifXii) 


_ 

er« 






dr,. 


(18-50) 


This has to be the solution of the exact Poisson-Boltzmann equation, 
(18-48), but it is expressed in terms of the (in general) unknown func¬ 
tions to,,-. 

The electrostatic potential at r,/ = 0, when the central j ion is charged 
to the degree £/ and owing to all ions other than j itself, is, from 
Eq. (18-50), 

*y tm (0; {/) = £ P'f v , 6—^ ),lT dr„ (18-51) 

where V' means that the region r,/ < a is excluded from the integration 
since no ions (other than j) are present there (to,/ = +oo for r,/ < o). 
If, in Eq. (18-36), we introduce Eq. (18-51) and use (18-1) for «<,, 
we have 

kT In =\i+ f 1 tm (0; {,) d{<, (18-52) 

Jo 

where X< is a small term arising from the region r„ < a [see, for example, 
Eq. (18-22)]. This is a generalized version of Eq. (18-20), used in the 
Debye-Huckel theory. 

Equation (18-42) for to,/ is converted into an equation applicable to 
electrolytes by substitution of Eq. (18-1) for «,/ and u*. An explicit 
equation for In 7j/7< in Eq. (18-47) can then be written down, using 
Eq. (18-42) for to,/ and Eq. (18-50) for ^/ (Problem 18-7). 

We now verify that the Debye-Httckel limiting expressions for a very 
dilute solution are consequences of the equations of the present section 
and Section 18-2. We seek a (limiting) solution of the series of equations 
of which Eqs. (18-42) and (18-43) are the first two members, using 
Eq. (18-1) for «y. We try a solution of the form 


«>*/({<) = w f ,({<) + to,/, 
tf*/i(!>) = to <,({,) + to,/ + to,*, 


(18-53) 

(18-54) 
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etc. (i.e., superposition), where 

z-zm a e~ ar ' is 

Vtifrti) = z -^ - (18-55) 

er ij 

Here a is a parameter whose value is to be determined. For arbitrary 
we replace z, by z,{,-. Further, we expand all exponentials, e~ w H ,kT , and 
retain only linear terms, pending establishment of the conditions that 
render this step legitimate (i.e., w^/kT <5C 1). We ignore small terms 
associated with the region r < a, because these prove to be negligible 
(as in the Debye-Huckel limiting law) under conditions such that the 
solution we find is valid. Substitution of Eqs. (18-53), (18-54), and 
e-waller _ i _ ( Wij /kT) in Eq. (18-43) and in all higher members of 
the series, reduces these equations to Eq. (18-42), which, therefore, is 
the only one we need consider. 

Equation (18-42) reads (after integrating over £,) 

Next, we substitute Eq. (18-55) in Eq. (18-56), factor out z,z ; * 2 /e, and 
write Tij = r, n. = u and rj, = v: 

—<*r i 2 _ f „—av 

-— = - - -Pr Y, / ~— dt - (18-57) 

r r ekT*-f Jv uv ' 

In Fig. 18-3, rotation of the element of area dx dy about the z-axis sweeps 
out dr„ so that dr, = 2-iry dx dy. We transform coordinates x and y to 
u and v, where 

w 2 = x 2 + y 2 , v 2 = y 2 + (r - a:) 2 . 

The Jacobian of this transformation gives 



nj = r 3 0 < V < 00 , 

Figure 18-3 \r — t>| < u < r + v. 
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Equation (18-57) becomes, then, 


where 



r+» 

//-/ o ,-*/*, = 

If—t>l 


(18-58) 


Thus Eq. (18-58) is an identity [that is, Eqs. (18-53) through (18-55) 
are a solution of Eqs. (18-42), (18-43), etc.] if 


2 4 ire 2 2 
a ~ ~(kf P,z ‘ ’ 


that is, if a = k (Eq. 18-11). Equation (18-55) turns out to be, then, 
just the Debye-Huckcl limiting potential of mean force, (18-28). 

From Eq. (18-55) with a = k, we find, as in the Debye-Huckel theory, 
that the ion atmosphere has an extent of order r = 1/k and hence that 
the linearization of e~ v>i > liT is legitimate if the solution is dilute enough 
to satisfy (18-26). 

Let us confirm that the remaining Debye-Huckcl results are also ob¬ 
tained. (a) When we put Eq. (18-55) (with a = k) in Eq. (18-50) for 
ypj, we find, using Eq. (18-58), that 

Ur) = e~". 


Therefore, w,y = z,«^y. However, on comparing Eqs. (18-42) and (18-50), 
we see that this is a limiting or asymptotic relationship. In general, 
Wij 7* z&frj (see also Problem 18-7). (b) Equation (18-51) gives 

*r( 0 ; fy) = - ^ > 

and hence, from Eq. (18-52), 

mn7,-= -^l^- 


(c) Finally, if we put Eqs. (18-1) and (18-55) (with a = k) in Eq. (18-44) 
for p/kT, we obtain (Problem 18-8) Eq. (18-31) again. Note that the 
integral in Eq. (18-44) converges because the potential of mean force, 
(18-55), is a short-range potential if k > 0 (the ions i and j are both 
screened by ion atmospheres). But if we try to calculate second virial 
coefficients from Eq. (18-44) by taking the limit k —> 0 in jry [see the 
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paragraph following Eq. (17-10)], t»y in (18-55) approaches uy and 
the integral diverges (the ions i and j are no longer screened by ion atmos¬ 
pheres). In summary: Eq. (18-44) can be used in electrolyte theory, but 
we cannot expand py in this equation as a power series in the concentra¬ 
tions for this would lead to a virial expansion that diverges. 

Important, but advanced, recent papers on electrolyte theory have 
been published by Mayer,* Kirkwood and Poirier,f and Meeron.J 


Problems 

18-1. Show that the most probable value of r in the Debye-Hfickel ion 
atmosphere, (18-24), is 1/k. Find also the mean value of r, ?, in terms of k. 
(Page 329.) 

18-2. Verify Eq. (18-25) for 1/x in water at 25°C. (Page 329.) 

18-3. Show that the condition (18-26) amounts to 1 /*» 7.1 A for water 
at 25°C. (Page 329.) 

18-4. Verify the details in the derivation of the thermodynamic functions 
(18-29) through (18-33). (Page 330.) 

18-5. Derive Eq. (18-43) for w) k . (Page 333.) 

18-6. Extend the derivation of Eq. (17-10) to the pressure of a monatomic 
fluid mixture. (Page 333.) 

18-7. Show that 

kT In f ^1 = small terras + p, f f - **** 

L if J 7 lo «•<. 

X - e - w ‘J lkT - dr, d£i. (18-59) 

Verify that Y(/Y,. = 1 in the Debyc-HQckel limiting law concentration region. 
(Page 335.) 

18-8. Deduce the Debye-Httckcl limiting expression for p/kT, (18-31), from 
Eqs. (18-44) and (18-55). (Page 337.) 

18-9. Consider a charge zj« at the center of a sphere of radius a and with 
dielectric constant «i„. Outside the sphere is a dilute electrolyte of point ions 
in a medium of dielectric constant e. Use the linear Poisson-Boltzmann equa¬ 
tion outside the sphere. Show that the work done in charging up the sphere 
(keeping the electrolyte fully charged) against the ion atmosphere and dielec¬ 
trics, is 

_i\ 

2 ta \1 + k a « ta / 

* J. E. Mayer, J. Chem. Phyt. 18, 1426 (1950). 
t J. G. Kirkwood and J. C. Poirier, /. Phyt. Chem. 58, 591 (1954). 
t E. Meeron, J. Chem. Phy». 28, 630 (1958). 
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18-10. This is the same os Problem 18-9 except that the charge z,e is smeared 
uniformly over the surface of the sphere. Show that the work here is 


1 

2 


2 2 

Zfi 


«o(l + Ka) 


18-11. This is the same as Problem 18-9 except that the charge Zj« is smeared 
uniformly throughout the volume of the sphere. Show that the work here is 


1 

2 


2 2 




ta 


1 


<1 + Ka 



18-12. Show from Eq. (18-24) that the integral of the charge density »j(r) 
in the Debye-Hftckel theory, over all space, gives — rye, as expected. 

18-13. Show that in the Debye-Httckel ion atmosphere, (18-24), the fraction 
of net charge between a and 2a, in the limit as xa —* 0, is 3(ica) 2 /2. 

18-14. Use Eq. (18-23) to calculate Y ++ in a 2-2 aqueous electrolyte at 
25 # C and at a concentration 10 moles-1 -1 . 

18-15. Find the expression for 1/x, equivalent to (18-25), which is applicable 
for (a) a 2-2 electrolyte, (b) a 3-3 electrolyte, and (c) a 1-1 electrolyte in 
solvents with (t)< = 10 and (ii)e = 1 (solvent = vacuum; i.e., a plasma). 
In each case find the value of c giving 1/x =* 30 A. 

18-16. Calculate and compare the magnitude of terms in Eqs. (18-22) and 
(18-31) for a 1-1 aqueous electrolyte at 25®C and a concentration 10~ 8 moles-1 -1 . 
Take a — 3 A. 

18-17. Verify the evaluation of the integral in Eq. (18-58). 

18-18. Show that Eq. (18-36) leads to 


In 72 — Bupi + 2 B 02 P 2 + • • • 


for a dilute binary gas mixture, a result used in Eq. (18-21) (obtained from 
Chapter 15). 


Supplementary Reading 
Fowler and Guggenheim, Chapter 9. 

Habned, H. S., and Owen, B. B., Physical Chemistry of Electrolytic Solutions, 
3rd ed., New York: Reinhold, 1958. 

Kirkwood, J. G., J. Chem. Phys. 3, 300 (1935); Chem. Revs. 19, 275 (1936). 
Landau and Lifshitz, Sections 74 and 91. 
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CHAPTER 19 

DILUTE LIQUID SOLUTIONS 

The main thesis of the present chapter is that the expansion methods 
of imperfect gas theory (Chapter 15) can be applied to a liquid solution 
which is dilute with respect to at least one (“solute”) component, pro¬ 
vided that the interaction between solute molecules is a short-range one 
(energy « r -n , n > 3, for large r). The technique is to reduce the many- 
body problem to, successively, (a) the interaction of one solute molecule 
with pure solvent (this determines the Henry’s law constant, for example), 
(b) the interaction between two solute molecules immersed in pure solvent, 
etc. This procedure is not possible, as we saw in Chapter 18, with the 
long-range forces in an electrolyte solution. 

The same expansion methods can be applied to gaseous and solid 
solutions, but these cases are less important than liquid solutions. 

The expansions encountered in a particular solution theory depend 
on the choice of independent variables and ensemble. The expansions of 
the different theories may be interconverted by suitable thermodynamic 
manipulations, so any formally exact theory is in principle equivalent to 
any other one. But, generally speaking, if one is interested in the expan¬ 
sion of a given dependent variable as a function of a certain set of inde¬ 
pendent variables, the simplest result will be obtained if the ensemble is 
suitably chosen to yield the expansion directly—instead of indirectly by 
thermodynamic operations on series obtained from another ensemble. 
Hence there is some point in discussing more than one dilute solution 
theory. 

We shall emphasize in this chapter, rather arbitrarily, the McMillan- 
Mayer theory, which is particularly suitable for an investigation of 
osmotic systems and the osmotic pressure. Section 19-1 contains a gen¬ 
eral discussion of this theory, while Section 19-2 is devoted to particu¬ 
lar applications. For contrast, we consider in Section 19-3 a solution 
theory which is especially appropriate for solutions at fixed pressure 
and temperature with molality as composition variable. This is a very 
common choice of independent variables in practical solution thermo¬ 
dynamics. 

An important alternative approach to solution theory, which we shall 
not include, is that due to Kirkwood and Buff.* 


* J. G. Kibkwood and F. P. Buff, J. Chem. Phys. 19, 774 (1951). 
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“Outside” 

“Inside” 

Component 1 

j Components 1,2 

l»,T 

; Ml, M2, T 

P,V 

l v + n, v 

#>2 = 0 

1 Pi 

J_ 


Semipermeable 

membrane 


Fig. 19-1. Osmotic equilibrium system. The membrane passes component 1 
(solvent) but not component 2 (solute). 

19-1 McMillan-Mayer solution theory. We begin by considering a 
two-component solution which may contain monatomic or polyatomic 
molecules, and which may obey classical or quantum statistics. Com¬ 
ponent 1 is the solvent and component 2 the solute, the dilute component. 
We assume short-range forces between solute molecules, as already ex¬ 
plained above. Our object is to choose partition functions in such a way 
that we can develop a treatment here for “solute in solvent” which exactly 
parallels the treatment of “gas in vacuum” in Section 15-1. For this 
reason, the reader should review Section 15-1. The derivation we give* 
is chosen for its relative simplicity, rather than for its generality-f 

The two-component solution referred to above (the “inside” solution 
of Fig. 19-1) is in osmotic equilibrium with pure solvent (“outside” in 
Fig. 19-1). That is, a semi permeable membrane, permeable to solvent 
only, separates solvent from solution. The outside system has specified 
values of the chemical potential mi and temperature T. The values 
of mi and T fix the pressure p. The “inside” solution is also at tem¬ 
perature T and contains solvent at mi and solute at M 2 - The values of 
T, mi, and M 2 then determine, for example, the inside pressure p + n and 
solute number density P 2 - H is the osmotic pressure—the extra pressure 
needed on the inside (in the presence of solute which tends to lower mi) 
to give mi the same value inside as outside. 

The grand partition function for the “inside” solution is 

e <P+n>v,kT = £ Q Nl , N2 (V, T)xf‘Xf 2 
Wi,iVa>0 

= £ *JV,(M 1 , v, (19-1) 

N a >0 


* T. L. Hill, J. Chem. Phys. 30, 93 (1959). 

t More general and complicated derivations will be found in S.M., Chapter 6, 
and in McMillan and Mayer (Supplementary Reading list). 
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where 

'I'ATj = X) (19-2) 

N |>0 

Here ’J r A r a (# i i> V, T) is a “semigrand” partition function for a system at 
V and T, open with respect to 1 but not with respect to 2 (Problem 19-1). 
We note that in the special case N 2 = 0, 

*o = £ Qvtf 1 = e pvlkT , (19-3) 

N ,>0 

which is the grand partition function for the outside system. We shall be 
interested in small values of N 2 , that is, ¥i, 't f 2 , ^ 3 , etc., for solvent systems 
(pH, V, T) with only one, two, three, etc., solute molecules immersed in 
the solvent. We now have to digress to introduce activities, the Henry’s 
law constant, etc. 

If, as in Section 15-3, we define activities z\ and z 2 by Eqs. (15-41), 
then zi —» pi and z 2 -* p 2 in the limit p 1( p 2 —* 0 (ideal gas mixture). 
In this limit 

Pi = Pi (T) + kT In p u 
Pa = V&(.T) + kT In p 2 , 

where 

^ T >- l ‘ Tk {mr)' AT) = kT ' n {mr)' <««> 

Examples will be found in Eqs. (4-26), (8-40), and Problem 9-3. 

It is more convenient for the present problem, however, to define a 
solute activity 9 2 , proportional to z 2 , but with the property that 3 2 —* p 2 
when p 2 —» 0 in the inside solution (i.e., with pi and T fixed). In this limit 
we have pure solvent at pi and T, and not an ideal gas mixture, so z 2 does 
not approach p 2 but approaches instead, say, 7j{p 2 , which defines 7§(pi, T). 
This is a limiting value of the activity coefficient 7 2 defined by z 2 = 7 2 p 2 
(Eq. 15-47). Hence the connection between 3 2 and z 2 is z 2 = 7§(pi, T)3 2 . 
We shall encounter explicit expressions for 7® below. 

The inside solution in the limit p 2 —»0 is often referred to as the 
“infinitely dilute solution.” When p 2 = 0, the inside solution is the same 
as the solvent (outside). 

The fugacity of the solute in the solution (at any composition) is defined 
as usual by / 2 = z 2 kT. (That is, if the solute here is in equilibrium with 
an ideal gas phase in which the partial pressure of component 2 is p 2 , then 
/ 2 in solution is equal to p 2 in the ideal gas, since z 2 is the same in the two 
phases at equilibrium.) Now we define the Henry’s law constant k 2 (pi, T) 
for the solute in the solvent at pi, T by the statement that / 2 —* k 2 x 2 as 
x 2 — * 0, where x 2 is the mole fraction of the solute. In this limit, 
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x 2 —* PzViirn, T)> where vi is the volume per molecule of pure solvent 
at mi, T. Therefore, in the limit as p 2 —♦ 0, we have 


or 


«2 = 7®p 2 


k2pjVl 

kT ’ 


h(Pi, T) 


Ti0»i, T)kT 
T) 


(19-5) 


which gives k 2 in terms of 7®. 

Returning to the activity i 2 , we shall verify below that the definition 


&2 


X 2 *i 

* 0 V 


(19-6) 


gives, as required, i 2 -* Pi as Pa —*► 0 in the solution. Accepting this 
definition tentatively, we find that 


or 



= 


7 2 X 2 ¥i 

* 0 V ’ 


,0 _ Qoi^o _ 
*1 ” 


Ve~^ lhT ^o 

kT*t 


(19-7) 


Since 7§ is a function of mi and T only and Q 0 i is proportional to V, 
¥i/¥ 0 must also be proportional to V. We see that y 2 depends on the 
properties of one solute molecule in a vacuum (i.e., on Q 0 i) and on the 
nature of the interaction between one solute molecule and pure solvent 
(i.e., on ¥i/¥o). We shall examine a special case of Eq. (19-7) in Problem 
19-2. Equation (19-7) gives for ft 2 , 


. _ Qoi*okT 

2 *i»i 

From the relations 


(19-8) 


QoiX 2 

V 


Z 2 p 2 


(component 2 in vacuum), 


QoiXa 

y%v 


d 2 


P 2 


(component 2 in solvent), 


we see that Q 0 iA 2 plays the role of an "effective* partition function for 
one solute molecule in a volume V filled by solvent at j»i and T (analogous 
to Qoi for one molecule of component 2 in a volume V otherwise unoc¬ 
cupied). The factor 1/Y§ thus takes care of the "contact” of the solute 
molecule with solvent. This gives some indication of the physical sig¬ 
nificance of y% (see also Problem 19-2). 
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Another interpretation of is the following: 


t'2 
Z 2 


p 2 (solute in infinitely dilute solution) _ Qoi/y 2 V 
P 2 (solute in infinitely dilute gas) — Qoi/V 


1 

*80*1, T) 


= mu T). 


Comparison with Eq. (10-6) shows that l/7§ = K(pi, T) is the “equi¬ 
librium constant” for the process 


solute in gas <=* solute in solution. 


The numerical value of 7§ depends on how a single solute molecule “likes” 
(including both energy and entropy effects) being in a vacuum relative 
to pure solvent. 

We now return to the expansion (10-1). To simplify the notation we 
use N for N 2 (number of solute molecules), replace X 2 by b 2 (Eq. 19-6) 
and ^r N by Z%, where Z%(mu F, T) is defined by 




(19-9) 


Note that Z* = V. The definition (19-9) is constructed so that Z* N plays 
the same formal role here for AT solute molecules immersed in the solvent 
as Zn does in Section 15-1 for N molecules in a vacuum (one-component 
gas). This will become more apparent as we proceed. Then Eq. (19-1) 
becomes 

e u , + n >^=* 0 + * 0 

N >I " l 

Using Eq. (19-3), we have 

e nvlkT = 1 + S T) *1 (19-10) 


This equation is seen to be formally identical with Eq. (15-6) for a one- 
component gas: II replacesj>, Z% replaces Zn, and b 2 replaces z. (We still 
have to check later that i 2 —■* p 2 as p 2 —■> 0 in the solution, just as 
z —> p as p —* 0 in the gas.) The solute is thus treated as a quasi-one- 
component system, with the solvent playing an implicit background role 
through its influence on Z%. We shall be able to put this in more physical 
terms below for a special case. Incidentally, this point of view is obviously 
the same as that adopted for the solvent, without formal justification, in 
Chapter 18 (on electrolyte theory). 
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Equation (15-6) is a special case of Eq. (19-10): if the “solvent” is a 
vacuum, then the outside pressure p in Fig. 19-1 is zero and II is the total 
pressure of the gas (solute) on the inside. 

We can now manipulate Eq. (19-10) just as we did Eq. (15-6). On 
taking the logarithm of Eq. (19-10) and expanding [see also the discussion 
of Eqs. (15-14) through (15-16)], we find 


n_ 

kT 


= £ bj(p i, T)ii 
j> i 


(19-11) 


where the b* are related to the Z *Jr by Eqs. (15-8). From the thermo¬ 
dynamic equation [see (1-65)] 




~ a(p + n) v l 

. 1 , 


T,V 


we have (p is determined by pi and T only and hence is constant) 


P2 = 


V 


i 2 


k d&2 Jn.T 


£*»<ft, T)ii 

j>i 


(19-12) 


just as in Eq. (15-10). We finally confirm here the property already used 
above that » 2 —»P 2 as p 2 —» 0, since b* = 1. Then, as in Eq. (15-12), 
we have the osmotic pressure virial expansion 


— p 2 4- £ #«(pj, T)p", (19-13) 

n> 2 

where the B* are related to the b* by Eqs. (15-13). In the limit p 2 —* 0, 
II /kT -* p 2 . 

From the general thermodynamic equation, for a binary solution, 


N i dp i + iV 2 dp 2 = -S dF + V dp, 


we have in our special case and notation, 

7?2 dp 2 = 7 d(p + II) = Vdn (jpi, T constant). 

Therefore 

/ dp 2 /kT \ _ J_ /<m/kT\ 

\ &P2 /h.t p 2 \ &P2 Jm.T 


(19-14) 


We substitute Eq. (19-13) here for II /kT, carry out the differentiation, 
multiply by dp 2 , and finally integrate. The result is 


^ = constant + In p 2 + 


eC-F) 


Bt+ip2, (19-15) 
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which is the analog of Eq. (15-19). To evaluate the integration constant, 
we recall that in the limit p 2 —► 0, 

Q01^ tlkT _ ^,0- -yO 

Z 2 — - y - — 7 2 92 —* ~2/>2- 

From this relation we find that 

, t , Vy% ,7*0 M 2 I, r, 

constant = In = In — — = r~ + In k 2 V\. 

Voi *i hi 

Therefore Eq. (19-15) can be written 

0 = In ^ + In p 2 + In 7a, (19-16) 

where 

In*- -£rtG*i ,T)p\, (19-17) 

*>1 

ft = - Bt+ (19-18) 

The activity coefficient 7 2 (the bar does not mean an average value) is 
obviously defined by the equation i 2 = P 2?2 and has the property, 
desirable for a practical activity coefficient, that 7 2 —► 1 as p 2 —> 0 (mi 
and T held constant). The relation between 7 2 and 7 2 is 

?2 = ^2P2 = 7j}j — 7 2 72P2> 
or 

72 = ^- (19-19) 

The treatment above parallels completely that in Section 15-1 for a 
one-component imperfect gas. Here, when p 2 0, interaction between 
one molecule and pure solvent is responsible for the term In (F*o/*i) 
in Eq. (19-16) for M 2 but does not influence the “equation of state,” 
n = p 2 kT; at higher concentrations, B*, which depends on the properties 
of a pair of solute molecules in pure solvent, contributes to p 2 and U/kT; 
etc. The quantities Z%, b*, B*, /Sj have a significance for solute molecules 
in solvent which is identical with that of Zy, bj, B n , and 0* for gas mole¬ 
cules in vacuum. 

The above equations are necessarily quite formal because of their 
generality. To give a better physical understanding of their significance, 
we now turn to the simplest special case, a classical binary solution of 
monatomic molecules, and introduce the potential of mean force. 
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Classical binary solution of monatomic molecules. We first have to extend 
Eq. (17-35) for g w = e~ w n),kT to a binary system. We let {«i} represent 
the spatial positions r t ,, r Ml of a set of n x molecules of component 1, 
and let d{ni} represent dri... dr ni . Then, as in Eq. (17-33), the proba¬ 
bility that, in a closed binary system with numbers of molecules N 1 and 
N 2 , any n x molecules of component 1 will be found in d{»i) and any w 2 
of component 2 in d[n 2 ] is 


Nil N 2 \ 

Ufi - m)! (N 2 - n 2 )\ 

e-vs^ 7 d { Nl _ ni } d{N 2 - « 2 } 

ZniN 2 


X 


d{»i} d{n 2 } 


(19-20) 


The probability that an open system characterized by z u z 2 , V, T contains 
the numbers of molecules N j, N 2 is 


Zi'Z^ZniNi 

Ni\N 2 \3 


(19-21) 


If we take the product of (19-20) and (19-21) and sum over N x and N 2 , 
we have the probability that any n x molecules of 1 are in d{ni} and any 
» 2 of 2 are in d{n 2 ), in an open system. We denote this probability by 
Pi'P 2 * 0 w d{ni} d{n 2 ), which defines the distribution function g in) (n 
refers to the set of numbers »j, n 2 —a general notational system we have 
been using throughout most of this book), a function of z it z 2 , and T as 
well as of {«!) and {n 2 |. Thus we have 


pVpV9 w = | £ 






S N &, (#1 ~ ~ B »> ! 


X j y e~ v HxNt liT d{NI - Wj} d{N 2 - n 2 }, 


(19-22) 


which is the generalization of Eq. (17-35). Since N x and N 2 are merely 
dummy summation indices, we can change notation by putting N x for 
N x — «i, N x + nj for N x , etc. This gives 

<A*, to, T) = f v exp (- Un+n/kT) d{N}. (19-23) 

where z refers to the activity set z x , z 2 . 

Now we define to (n> by 

g w = exp [~w w /kT], (19-24) 


substitute this expression in Eq. (19-23), and integrate over {n}. The 
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result is 

J exp [—w w /kT] d{n} 


Zn>0 (z^/N.lN^Zn+n nQ _ 2 -v 
Zh>o (z?'2%*/N 1 \N 2 \)Z« 


As in Section 17-4, w {a) is the potential of the mean force acting between 
the set of molecules n located at fnj in the solution characterized by z, 
V, T. We are particularly interested in the special case of a set of N 
solute molecules only, n = 0, N, in the limit as z —* zj, 0 (i.e., AT solute 
molecules immersed in pure solvent at zj, T —the outside system in Fig. 
19-1). In this limit, only terms with N 2 = 0 contribute in the sums of 
Eq. (19-25), so we find 


[ _«>/» yf 2>,>o tf'/NtQZw 

V d ' K> - ,!)Z„ 


(19-26) 


We shall make use of this result presently. 

If we substitute Eq. (19-7) in Eq. (19—9), we obtain 

_ ATttyF* (*oY _ Ntosyf A| y 
Zn ¥ 0 *o 


Also, from Eq. (19-2), we have 


*A- = 


1 

A 2 NNl £>o N >' 


(19-27) 


(19-28) 


Using Eq. (19-28), we eliminate ¥n and ¥ 0 fro" 1 Eq. (19-27) and observe 
that Z% is the same as the right side of Eq. (19-26). Therefore, 

Z% = f e -'° m ' kT d{N}. (19-29) 

Jv 

Equation (19-29) should be compared with Zn for a classical one- 
component monatomic gas (Section 15-2): 

Z N = f r e' Ulf,kT d{N}. (19-30) 

We see that, for classical monatomic systems f at least, the analogy be¬ 
tween the formal significance of b*, etc., for a dilute solution and 
Zn, bj, etc., for a dilute gas, already emphasized above, extends even 

further: Un in Eq. (19-30) is the potential of the force between N gas 


t Actually the analogy between w W) and Un is much more general. It extends 
to systems of polyatomic molecules of any complexity in which vibration is 
separable and in which rotation (internal and external) and translation are 
classical. See S. M., Section 40. 
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molecules in a vacuum, while w m in Eq. (19-29) is the potential of the 
(mean) force between N solute molecules in the pure solvent. Thus the 
physical significance, in terms of force or work, of w (N) and Un in the 
respective cases is identical. 

This analogy makes it possible to write down immediately, by inspec¬ 
tion, a number of equations for dilute solutions. For example, from 
Eq. (15-24) for a one-component gas, we have 

Bfcu T) = -if* [e~“’ <J> < r ' Zl ' r>/A:r - l)4irr 2 dr. (19-31) 

Similarly, from Eq. (15-30), we have an equation for B%{z u T). It should 
be emphasized here, though, that the assumption 

U 3 = W 12 + Mu -)- U 23 , (19-32) 

used in Eq. (15-30), is in a different class than 

u> <3) — Wn -|- W 13 + u>23) (19—33) 

needed for B\. Equation (19-32) is a quantum-mechanical approximation, 
and a good one in general, but Eq. (19-33) is statistical mechanical (the 
w’s are functions of Z\ and T) in origin and not necessarily a very good 
approximation. Equation (19-33) is in fact the superposition approxima¬ 
tion of Chapters 17 and 18. Another example is Eq. (17-10): 

FT = P2 ~ A L rM/(r> T> *»’ 0 )e -w<r,r ' z " 22)/t7 47rr 2 dr, (19-34) 

where w — u/ 2> and here the solution is not necessarily dilute. The 
superposition approximation (pairwise additivity) has been made use of 
in u>(zi, 0) but not in w(z u z 2 ). The extension of Eq. (19-34) to a multi- 
component solute is obvious from Eq. (18-44). It is this extension which 
is in reality involved in Problem 18-8 for the Debye-Huckel p (actually, 
n), if the dielectric is not a vacuum. 

In view of the fact that the solvent does not play an explicit role in 
Eq. (19-29) and in view of the interpretation of iu w> as the potential of 
mean force between solute molecules immersed in the outside solution 
(solvent) of Fig. 19-1, it is physically obvious (and can easily be con- 
firmedf) that whether the solvent is made up of one component or many 
components, formal relations such as Eq. (19-29) and those between 
Z* N , b*, B*, etc., are unchanged. A multicomponent solvent influences 
the system indirectly or implicitly through its influence on w W) , which is 
now a function of T and one z for each solvent species (we denote this 


t See S. M., Section 40. 
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“Outside” “Inside" 



membrane 

Fig. 19-2. Osmotic equilibrium system. The membrane passes all solvent 
species (subscript r) but not the solute (no subscript). 


set of solvent z’s by z r ). For example, Eqs. (19-13) and (19-17) become 
4=P+£ B:(Xt, T)p n , (19-35) 

n>2 

In’? = £ Bt +1 (z T , T)p k , (19-36) 

ki. i ' K ' 

where p is the number density of the single solute (“nondiffusible”) species 
on the “inside” in Fig. 19-2 and 7 is its activity coefficient. All of the 
solvent species can pass through the semipermeable membrane in Fig. 
19-2. The second virial coefficient is 

B*(z t ,T)= ~if o {exp (—w(r, Zt ,T)/kT) — l}4irr 2 dr, (19-37) 

where w(r) is the reversible work necessary to bring two solute molecules 
together from r = oo to r in the solvent at z,, T (i.e., in the outside 
solution). 


19-2 Applications of the McMillan-Mayer theory. We consider here a 
few examples of the calculation of B\ for a solute in a solvent (which may 
be multicomponent). then determines the first departures from 
limiting behavior in Tl/kT and In 7 [Eqs. (19-35) and (19-36)], for example. 

Hard spheres. Suppose the nondiffusible species (i.e., the solute) in an 
osmotic equilibrium can be represented approximately by hard spheres 
of diameter a. Spherical protein molecules, under some conditions, might 
be an example, f That is, 


w(r) — +oo r < a 

= 0 r > a, 


(19-38) 


t The many “internal” degrees of freedom—vibration and rotation—of such 
a molecule can be ignored in calculating virial coefficients. 
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as in Eq. (15-25). Then we find, as in Eqs. (15-26) and (15-30), that 

B* — > B* = iB* 2 \ (19-39) 

If we let v m be the volume occupied by the molecule (sphere), B% — 4v m . 

Considerable work has been donef on the calculation of B\ for “hard” 
molecules which are not spheres. Of course, in this case a generalization 
of Eq. (19-37) has to be used (Problem 19-3) which involves integration 
over rotational orientations as well as over r. It is found that B\/v m is 
always greater than 4 for nonspherical particles and that, for highly 
anisometric particles (e.g., “needles” or “pancakes”), B%Jv m is approxi¬ 
mately equal to the ratio of the long to the short dimension. Solutions 
of such particles will therefore be “nonideal” at much lower molar con¬ 
centrations than spheres of the same volume. 

The calculation of B\ for polymer and polyelectrolyte molecules will 
be deferred to Chapter 21. 

Donnan membrane equilibrium. A Donnan equilibrium is an osmotic 
equilibrium in which the nondiffusible species (solute) is charged and all 
other species, including at least two which are charged and of opposite 
sign, can pass through the membrane. 

Necessarily, if there is a charged nondiffusible species, then for electrical 
neutrality at least one other charged species must be present on the 
“inside.” If there is only one other charged species (of sign opposite to 
the solute) on the inside, it too will be confined to the inside, even though 
it could under other conditions pass through the membrane, for otherwise 
the outside solution would not be electrically neutral. (Even extremely 
small departures from neutrality would be resisted by the creation of 
very large electrostatic potentials.) The uncharged solvent can, of course, 
be on both sides. In this case, then, we have an electrolyte solution on the 
inside and solvent (usually water) on the outside. This is not a Donnan 
equilibrium. The osmotic pressure II here is just that (called p) of Chap¬ 
ter 18. We cannot use a virial expansion of II because w(r) (=w <2) ) for 
a pair of ions immersed in the outside system (solvent) has the coulombic 
form, 1/r (Eq. 18-1), and hence the virial coefficients diverge. Incidentally, 
the reason why the coulombic potential energy (18-1) is denoted by u{r) 
in Chapter 18 and by w(r) here is that we are now treating the solvent as 
a molecular component of the system: Eq. (18-1) is an approximation for 
the work necessary to bring two ions together, calculated from a mean 
force, that is, a force averaged over all configurations of the solvent 
molecules. 


f See B. H. Zimm, J. Chem. Phys. 14, 164 (1946); L. Onsaoer, Ann. N. Y. 
Acad. Sci. 51, 627 (1949); Hirschfelder, Curtiss, and Bird, pp. 183-187. 
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To achieve a Donnan equilibrium, we must have, besides the charged 
solute on the inside, at least two other charged (opposite to each other) 
species on the inside. Then both of these latter species can also be on the 
outside, with electrical neutrality achieved on both sides. An example 
would be P + (protein, say), Na + , Cl - , and H 2 0 on the inside, and Na + , 
Cl - , and H 2 0 on the outside. The “solvent” here (i.e., the outside system) 
is itself an electrolyte solution. Equations (19-35) and (19-36) are ap¬ 
plicable. We can use a virial expansion approach because w M) ( z T , T) for 
a set of A charged solute molecules refers to the set immersed in the out¬ 
side (electrolyte) solution. That is: the solute molecules in the solvent 
are screened by ion atmospheres; w(r) has the form e~"/r (k refers to the 
outside solution), at least approximately; hence, the virial coefficients 
do not diverge. 

The traditional approach to this problem is the method of Donnan in 
which a membrane potential is introduced. One of the more satisfying 
aspects of the present statistical-mechanical method is that this opera¬ 
tionally questionable conceptf does not enter the discussion. 

As a first example of a Donnan equilibrium, let all ions (including the 
solute) be treated as point charges (a = 0), let the charge on a sol¬ 
ute ion bet ze, and let us use the Debye-IIiickel limiting w(r) (Eq. 18-28) 
for a pair of solute ions immersed in the outside solution: 


w(r) = > (19-40) 

where k depends on the outside ionic strength. We note that intensive 
properties of the system are completely determined by the concentration 
p of solute on the inside, by T, and by the dielectric constant and ionic 
strength on the outside. Examples of such intensive properties are II, 
In and pl/p* for a diffusible ion of type k (Problem 19-4), where pj is 
the inside concentration and p® is the outside concentration. 

To calculate B% we put Eq. (19-40) for w(r) in Eq. (19-37), after 
expanding the exponential in the latter equation up to the term in ( w/kT ) 2 . 
This suffices to give us B\ up to the linear term in a = « 2 K/ekT, which is 
as far as we are justified in going [see Eq. (18-26)]. Elementary integra¬ 
tions then give 

= 2L ~ 8L ’ (19-41) 


t See E. A. Guggenheim, Thermodynamics. Amsterdam: North-Holland, 
1957; pp. 374-381. 

t Note the difference between the symbols r (charge number) and z (activity 
set). The distinction will also be clear from the context. 
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where 

£ = £pM. 

* 

The first term on the right can be shown to be a consequence only of the 
fact that the Debye-Hfickel u>(r) is consistent with electrical neutrality 
in the outside solution (Problem 19-5). The second term is probablyf not 
the exact linear term in a, because higher terms in w(r) [i.e., Eq. (19-40) 
gives the leading term in an expansion] may also contribute linear terms 
in a to B%. 

As a second example of a Donnan equilibrium, let us calculate for 
the potential of mean force between solute molecules (in the solvent), 

w(r) = +oo r < a 

2 2—*(r— a) (19-42) 

= «r(l + Ka) r ~ 


where k again refers to the outside solution. This w(r) would be appropriate 
as an approximate potential in two cases. (1) All ions, including solute 
ions of charge ze, have a diameter a (Eq. 18-27). (2) The solute ions are 
relatively large spheres of diameter a, each with total charge ze smeared 
uniformly over the surface; the relatively small diffusible ions are treated 
as point charges (Problem 18-10). One finds in either case, 


B 


* _ 

2 — 


2wa 3 . z 2 z 4 ar 

3 + 2 E 8(1 + xa)2 Z ' 


(19-43) 


The first term on the right is the hard sphere term (Eq. 19-39), the second 
is the “neutrality” term, and the third is definitely not the exact linear 
term in a because Eq. (19-42) is not even an exact leading term for w(r). 
However, Eq. (19-43) should be a useful approximation for a not too large. 

Solute with binding equilibrium. A relatively complicated but instruc¬ 
tive and important (especially in physical biochemistry) application of 
the McMillan-Maycr theory is to an osmotic system in which the solute 
molecules can bind (adsorb) one of the diffusible (solvent) species. The 
most important example is the binding of hydrogen ions by solutes such 
as proteins, nucleic acids, polyelectrolytes, etc. But in other systems, 
other ions or molecules, in the solvent, may also be bound to solute mole¬ 
cules. Some of the questions that arise for this kind of system are the 
following: How is the binding equilibrium (e.g., the titration curve, if H + 
is being bound) influenced by solute concentration? How does the osmotic 


t Sec T. L. Hill, Faraday Soc. Disc. 21, 31 (1956); J. Phys. Cbem. 61, 548 
(1957). 



354 


DILUTE LIQUID SOLUTIONS 


[CHAP. 19 


pressure second virial coefficient depend on the concentration of the species 
being bound? How does the potential of mean force between a pair of 
solute molecules (at infinite dilution, p —» 0) depend on the concentration 
of the species being bound? 

The present problem is a generalization of Section 7-3, which should 
be reviewed. In Section 7-3 we were concerned with binding of one species 
on another in an ideal gas mixture. Here we have intersolute interactions 
and the presence of solvent to complicate matters. 

To keep the notation from getting too complex, we consider the follow¬ 
ing special case: (I) there is only one solute species; (2) only molecules 
of one solvent species can be bound on solute molecules; and (3) the 
potential of average force u> <K) between a set of solute molecules depends 
on the number of bound molecules on each of the solute molecules in the 
set and on the location of the center of mass of each of the solute molecules, 
but not on the manner of distribution of bound molecules among the 
binding sites of each solute molecule nor on the rotational orientation of 
the solute molecules. All of these restrictions can, however, be removed.* 

For convenience we refer to the solvent species that can be bound as A 
(adsorbate). Each solute molecule can bind up to m A molecules, s = 0, 
1 ,,m. There are thus a total of m + 1 solute “subspecies,”depending 
on the value of s. The partition function of a single solute molecule with 
s A molecules bound to it (an s-solute), in a box of volume V at T, is 
denoted by q(s). We let X, be the absolute activity of s-solute molecules 
and X be the absolute activity of A molecules. The activity z, for an 
s-solute is defined as usual (Eq. 15-41) by 



Then z, —* p, if all species become infinitely dilute (ideal gas mixture). 
The activities of all solvent species other than A are represented by the 
set z T . The outside solution (solvent) in the osmotic equilibrium is there¬ 
fore characterized by z r , X, and T; and the inside solution is characterized 
by z r , X, T, and the total (all subspecies) solute concentration p(=£«p,). 

The general method we use is: (a) write an expression for UV/kT that 
is a generalization of Eq. (19-10), regarding the solute as multicomponent 
(»i + 1 “different” solutes); (b) introduce in this expression the fact that 
actually the solute subspecies are not independent of each other but are 
interrelated by binding equilibria; and finally (c) develop series expan¬ 
sions, etc., treating the solute as a single (composite) component. 


* J. G. Kirkwood and J. B. Shumaker, Proc. Nat. Acad. Sci. 38, 855, 863 
(1952); T. L. Hill, J. Chem. Phys. 23, 623, 2270 (1955). 
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The rather obvious generalization, which we shall not prove,* of 
Eqs. (19-10) and (19-29) to a multicomponent solute in an osmotic 
equilibrium, is 


nvikT _ 
6 — 



N,\ \)v 


exp [-w™/kT\ <f{N}, 


(19-45) 


where the sum is over all solute sets N = N 0 , Ni,, N m , and where 
7° and w <W) are functions of z T , X, and T. As explained in Section 19-1, 
7® depends on the interaction of one s-solute molecule with solvent (outside 
solution), and io (If) depends on the interaction between a set of N solute 
molecules immersed in the solvent (outside solution). 

Let Ni be the mean number of t-solute molecules in the inside solution. 
Then, as in the equation preceding (19-12), 

\ 02 i / z,* i ,t T ,KT.V \ 02 i /z,* i ,z r XT,V 

- [n T^]/ v “P <i(K}. (19-46) 

To conform with the notation in Section 7-3, we let 37 be the total number 
of solute molecules of all subspecies and let N be the total number of A 
molecules bound to these solute molecules. Then 


37 = £ff,-, 

»—o 

N = £ iNi. 

i— 1 


(19-47) 


Hence we get expressions for 37 and N from Eq. (19-46) by replacing 
N it behind the summation sign, by Hi Ni and £,■ iNi, respectively. 

At this point we recognize explicitly the fact that the solute subspecies 
are interconnected by binding equilibria: 


s-solute <=* 0-solute + sA, 

= Mo + 

X, = X 0 X‘, z. = (19-48) 

This means that only one of Xo, X t ,..., X m is independent, and we choose 
X 0 (0-solute; solute with no A bound). We substitute Eq. (19-48) for z. 


* See: Eqs. (15-43) and (15-51); S. M., Section 40; and Problem 19-15. 
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in Eq. (19-45) and obtain 


nr/ *r 


= £ 


N>0 L 


n 




w (V) /kT] d{N}, (19-49) 


where 



(19-50) 


The physical significance of H, (a function of T, X, z T ) for an s-solute 
molecule is evident from the paragraph following Eq. (19-8): q(s )/is 
the effective partition function of an s-solute molecule immersed in the 
outside solution; q(s) itself is discussed following Eq. (7-46). 

Note that if we perform the operation \ 0 d(nV/kT)/d\ 0 on Eq. (19-49), 
we get just TV, [see Eqs. (19-46) and (19-47)]. Therefore 


37 = X 0 


'dUV/kf \ 

k dX 0 /^.X.T.K 


(19-51) 


This shows that if we regard all solute molecules (of whatever subspecies) 
as a single component, then mo should be considered the chemical poten¬ 
tial of the solute (thus p 0 has the same significance as p! in Section 7-3). 
Next, we observe that the purely formal operation \d(nV/kT)/d\ on 
Eq. (19-49), ignoring the actual dependence of the y° and w (H) on X, 
gives sN,. Hence, 


\ vX /\ 0 ,T,V,-P,W 


(19-52) 


If we regard Eq. (19-49) as a power series in Xo, the solute absolute 
activity, we will have equations analogous to those in Section 19-1 for 
an osmotic equilibrium with a single solute. Here, however, the various 
coefficients will be more complicated because they involve averaging over 
the different solute subspecies. For simplicity, let us go only as far as the 
second virial coefficient.* If we consider only those sets N containing a 
total of 0, 1, or 2 solute molecules, Eq. (19-49) becomes 

e uvikT = i + Xo y H.\’ + |f L (ff.**)(ff.'X*') 


X J v e - w "' lkT dr, dr,’+ (19-53) 


* For higher terms, see T. L. Hill, J. Chem. Phys. 23, 623, 2270 (1955). 
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where the double sum includes terms with s = s', and to,,, is the 
potential of mean force between one s-solute molecule and one s'-solute 
molecule, immersed in the outside solution. We define a solute activity 
i by 

i = X 0 ^ H,\* (19-54) 


so that a —» p(="M/V) as p —► 0. Then, just as in Eq. (19-10), 


where 


e uvikT = 1+ ^a + !fa 2 + ..., (19-55) 

z\ = V, 

L,.,< (tf,X*)(ff.<X*') [ e-""' lkT dr. dw 

z ' 2 " (£, tf . x *) 2 (19_56) 


It is necessary to digress briefly to introduce the radial distribution 
function g(r) for solute molecules at infinite dilution, p —» 0 (i.e., in the 
outside solution). As in the paragraph preceding Eq. (17-21), 

p«p,. e ~ v "'" tT dr, dr,, 


is the probability that an s-solute molecule is in dr, and an s'-solute 
molecule is in dr,., in the limit p —* 0. In this limit, Eq. (19-46) gives 


or 


tt , . g(i)X 0 X’ 

To" — -,o 

>i 


v\ 0 jy.x*', 


Pi — Xo#,'X , 

p = 'Ep<= 

i i 


(19-57) 


Therefore the probability above becomes 

p 2 (H,\ , )(H,'\’ )e~ w,, ' lkT dr, dr,- 

(E.^.X *) 2 

If we sum this expression over s and s', we have the total probability that 
any solute molecule (any subspecies) is in dr, and any other is in dr,,. 
If we regard the solute as a single component, this same probability is 
p 2 g(r) dr, dr,,, where g(r) is the solute radial distribution function. On 
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comparing the two expressions, we have 


g(r) = e-« T)lkT 


E.,.- 

(E« 


-w..'(r)lkT 
- ) 


(19-58) 


where w{r) is the potential of mean force for a pair of solute molecules 
in the outside solution, properly averaged over the different solute sub¬ 
species (Problem 19-6). Since X is proportional to the activity and ap¬ 
proximately proportional to the concentration of the A molecules, Eq. 
(19-58) contains the dependence of w(r) or g(r) on this activity or con¬ 
centration. Incidentally, in Eq. (19-58), w„, is also a function of X since 
A is a component of the solvent ( w„, depends on r, z T , X, T), but this 
will generally be a relatively unimportant effect. 

We note that e~ wMlkT in Eq. (19-58) has the formal appearance of a 
one-component (A) grand partition function (Eq. 7-27) for A molecules 
bound on a pair of solute molecules a distance r apart divided by the 
grand partition function with r = oo. This result might have been antici¬ 
pated by analogy with the following thermodynamic relations for such 
a system: 

dE= TdS + DW m + ndtN, 
d(A — ix N) = DW on (T, n constant), 

—A(m N — A) = —kT In = w(r) (T, n constant), 

where DW oa and w(r) are reversible work done on the system, the latter 
being the work (potential of mean force) necessary to bring the two solute 
molecules together from r = oo. 

We return now to Eq. (19-56), which, with the aid of Eq. (19-58), 
becomes 

Z' 2 = ^ e~ wlkT d{2). (19-59) 


This has the same form as Eq. (19-29); furthermore, w has the same 
physical significance (potential of mean force) in the two equations, 
despite the complication here of subspecies averaging. This analogy proves 
in fact to be general; that is, for Z’ N . 

From Eqs. (19-10) and (19-55), it is clear then that we can define a 
group of quantities Z' N , b’ jt fi' k , B' n (all of which depend on the properties 
of small groups of solute molecules in the outside solution) in complete 
analogy with Z* N , b*, etc. For example, Eqs. (19-35) through (19-37) 
hold here for U/kT and In 7 provided we replace by B' n (Zr, X, T) and 
w by tc(r, Zj, X, T) in these equations. 
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Next, let us derive an equation for "N/TH, the mean number of A mole¬ 
cules bound per solute molecule. Specifically, we want to express ‘N/'M 
as a power series in p. From Eq. (19-52), 



Then 

Tt/V Tf L. sHX , 1 v / db', \ y 

p TR Z. H.\‘ P \ d ln X A-A« ' 


Using the series 3(p) (Eq. 15-11) and Eqs. (15-13) relating the B' n and 6', 
we find 

H Z. *H.\‘ a (dB ' 2 \ ,2 (SB's \ 

"M Y. . H,\* V In Vr,r*,« v In X/ T,y°,u> 

(19-60) 


In the limit p —> 0, we get, as expected, the same result as Eqs. (7-29) 
and (7-51) [except that, because of the presence of the solvent, g(s)/Yj 
replaces g(s)]. The terms in p take care of the effect of solute concentration 
on the amount of binding. We can understand the linear term in p, quali¬ 
tatively, as follows. If, for example, adsorbing A molecules on a pair of 
solute molecules increases the repulsion between the solute molecules 
(i.e., dB' 2 /d ln X > 0), then, when two solute molecules are brought to¬ 
gether from r = oo, they will desorb some A molecules. But this is 
essentially what happens when the concentration of solute is increased: 
pairs of solute molecules spend more time near each other, so "N/'M decreases. 

An explicit expression for dB' 2 /d ln X in Eq. (19-60) is easily shown to 
be (Problem 19-7) 


Z. 

V(Z»H.\')* 

(ff.x*)(ff.'X*')/ M , 

x l “ 


E.,.' («+ «')(ff.x*)(ff.'X*' 
2 Z> aff.X* 



(19-61) 


where is the integral in Eq. (19-56). 

As a final topic, we consider an important special case. Suppose that 
the solute and A molecules are charged and that the potential of mean 
force between a pair of solute subspecies a distance r apart is proportional 
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to the product of the charges on the two molecules: 

= z,z,-f(r). (19-62) 


For example, we might have (Eq. 19-42) 

2 -«(r-o) 

*>" «<1 + K a ) kf - 


Let the charge number of a 0-solute be z Q and that of an A molecule be z. 
Then z, — z 0 + sz for an a-solute. A typical case would be a protein mole¬ 
cule with z = +1 (A = H + ), z 0 negative, and z m positive. We expand 
exp (— w»,./kT ) in Eq. (19-58) up to the quadratic term: 


where 


e 


—w/kT 


- i -(w) +l AwT) 



(E.ff.X*) 2 


(19-64) 

(19-65) 


These arc averages, it will be noted, in which the probabilities of s and s' 
are “unperturbed” or independent of each other [compare Eq. (13-19) 
and Problem 14-17]. We put Eq. (19-62) into Eq. (19-64) and obtain 


where 




Then from Eq. (19-66), 


- GOVW + *(z 2 ) 2 /(r) 2 -, 

~ E. H.Xz” 


(19-66) 

(19-67) 


^ = (z) 2 /(r) - i[z 2 - (z) 2 j[? + (z) 2 J/(r) 2 + • • •. (19-68) 

The leading term in Eq. (19-68) gives the potential of mean force between 
a pair of solute molecules a distance r apart which would obtain if the 
binding of A molecules on one solute molecule were uninfluenced by the 
binding on the other (i.e., z is the average charge number of an isolated 
solute molecule in the solvent). Actually, the binding on the two solute 
molecules is not independent (Problem 19-8), and we expect that the 
perturbation of one by the other will always be such as to lower the poten¬ 
tial of mean force. This is confirmed by the second term in Eq. (19-68), 
which is always negative. 

The coefficients in Eq. (19-68) can also be expressed in terms of 5 
and s 2 , where the averaging here is the same as in Eq. (19-67) [thus 5 
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is equal to ft/HI in Eq. (19-60) in the limit as p —* 0—that is, isolated 
solute molecules]: 

Z = 20 + Sz, 2 ? — z 2 = z 2 [5^ — (5) 2 ]. (19-69) 

Also, it is easy to show that (Problem 19-9) 


Consider the special case in which X (or the concentration of A) is 
chosen so that z = 0; that is, two solute molecules far from each other 
(r = oo) in the outside solution would have zero charge on the average. 
An equivalent statement is that X is chosen so that the average charge 
on a solute molecule is zero (“isoionic point”) in the limit as p —* 0. Then, 
from Eq. (19-68), 


w(r) _ _ (, z 2 ) 2 f(r ) 2 
kT 2 


(19-71) 


This equation tells us that if the two solute molecules are brought to¬ 
gether to finite r, then w will be negative. The reason for this is the follow¬ 
ing: when the (fluctuating) charge number z, on one of the two solute mole¬ 
cules happens to be, say, negative, this will increase the probability of the 
other charge number z,, being positive [to,,, will be negative and hence 
this ss' combination gets extra weight in Eq. (19-58)]. Thus there is a 
correlation between the fluctuating charges on the two solute molecules 
tending to favor charges on them of opposite sign. This is closely analogous 
to (a) the net attraction between rotating dipolar molecules in a gas 
which arises from a correlation between molecular orientations favoring 
those mutual arrangements with negative potential energy, and to (b) 
the origin of London dispersion forces (Appendix IV). 

Incidentally, for simplicity we have assumed from the outset that w„, 
depends only on the total numbers of A molecules bound, s and s', and 
not on how these molecules are distributed among solute sites. It is clear, 
however, that if the A molecules are charged, the interaction w„. will 
involve not only the total charges we have been concerned with but also 
electric moments depending on particular A distributions. In this more 
general situation, then, there will be a contribution to a negative w(r) in 
Eq. (19-71) not only from total charge correlation but also from a cor¬ 
relation between electric moments associated with the distribution of A 
molecules among available sites on the two solute molecules. 

The simplest model to which we can apply Eq. (19-71) is the following: 
each solute molecule has m independent and equivalent sites for binding 
A molecules; q is the partition function for one A molecule bound on 
one site; and 7° is the same for all s (we neglect the dependence of 7° 
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on X). Then (see Problem 7-9) 

1 wig* 


H, = 


Vy° i\(m — s)! 


3 = 


mq\ 

i + gx’ 

15 _ _2r72 _ / s \2i = , 2 x — = z 2 mq\ 
z - z [s W J - z X dX - (1 + qX)2 

z 2 m 


2 + gX + (gX) 1 


(19-72) 

(19-73) 


The form (19-73) is essentially the same as Eq. (11) of Kirkwood and 
Shumaker. 

Finally, we calculate B' 2 , for use in U/kT or In 7, from the w„, of 
Eqs. (19-62) and (19-63). Comparison of Eq. (19-66) with the expansion 
of e~ w,iT from Eq. (19-42) shows that B' 2 is given here by Eq. (19-43) 
but with z 2 replaced by (z) 2 and z* replaced by (z 2 ) 2 . In the special 
case z = 0 (“isoionic” at p = 0), B' 2 can be written 


27ra 3 _ 7r(z 2 ) 2 e 4 

3 2(€kT) 2 (l + ra) 2 »c ’ 


(19-74) 


where k refers to the outside solution. The first term is positive and 
the second is negative. At sufficiently low ionic strengths, the second 
term predominates (Problem 19-10). Equation (19-74) is the same as 
Eq. (17) of Kirkwood and Shumaker.* 


19-3 Constant pressure solution theory, f The solution theories of 
McMillan and Mayer and Kirkwood and Buff are based on the grand 
canonical ensemble and open system distribution functions. The natural 
composition variable is the concentration, since the system is at constant 
volume. These theories are formally exact and necessarily equivalent 
through suitable thermodynamic manipulations. In the present section 
we discuss an alternative, rigorous solution theory designed to yield di¬ 
rectly thermodynamic functions expressed in a particularly practical 
form. For example, for a binary solution, the chemical potentials and 
partial molal volumes, entropies, and heat contents can be developed as 
power series in the molality of the solute, with coefficients which depend 
on properties of the solvent (and small sets of solute molecules) at the 


* See S. N. Timasheff, et al., J. Am. Chem. Soc. 79 ,782 (1957) for an experi¬ 
mental confirmation. 

t See T. L. Hill, J. Am. Chem. Soc. 79, 4885 (1957); J. Chem. Phys. 30, 93 
(1959). 
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same pressure and temperature as the solution. The pressure (instead of 
the volume) is held fixed at the outset, and hence molality is a natural 
composition variable. One can substitute mole fraction for molality as 
composition variable, but we shall not discuss this possibility here. 

We restrict the treatment to binary solutions (1 = solvent, 2 = solute) 
and use a method which is closely analogous to that in the first part of 
Section 19-1. In Section 19-1, to prepare the solution we start with 
pure solvent at pi and T and add solute, holding juj and T constant (this 
is accomplished by use of a semipermeable membrane). The pressure 
changes as we add solute (the increase in pressure is n, the osmotic pres¬ 
sure). Here, on the other hand, we start with the same pure solvent but 
choose p and T as independent variables (instead of mi and T). We 
then add solute holding p and T constant. In this latter case, the solvent 
chemical potential changes. Hence the change in solvent chemical poten¬ 
tial plays a role analogous to that of II in Section 19-1. 

To obtain the desired independent variables, we use an ensemble 
apparently first introduced by Stockmayer (in a study of the relation 
between light scattering and composition fluctuations): 

r(W„ p, T, p 2 ) = e- s '“' lkT = £ V, T)e N ™ tiT , (19-75) 

N*> 0 

where 

An, = D Q(N U 1 V 2 , V, De~ pVlkT . (19-76) 

v 

The partition function An, is the p, T, N partition function already 
encountered several times [e.g., Eqs. (1-87) and (1-91)]. The partition 
function r is a “semigrand” partition function for a system, at p and T, 
open with respect to 2 but not with respect to 1. The right side of 
Eq. (19-75) is seen to be a power series in the absolute activity of the 
solute, X 2 = e‘‘* lkT , with coefficients which depend on properties of the 
solvent (Ni, p, T) containing small numbers (JV 2 ) of solute molecules. 

For convenience, we replace the absolute activity X 2 by a more practical 
activity a 2 , proportional to X 2 , but defined in such a way that (as will 
be seen below) o 2 —► m 2 as m 2 —»0, where m 2 = 27 2 /JVi. We shall 
refer to m 2 as the “molality” of the solute, though this differs from the 
conventional molality, 1000 m 2 /Wi, by a constant (TFi is the molecular 
weight of the solvent). 

The substitution of o 2 for X 2 in Eq. (19-75) gives, after dividing by 
the leading term, A 0 , 


T , i V 1 Xn n 


U9-77) 
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where 

^ AHAvA^- 1 ^ 

= Af 

(19-78) 


XjAt 

2 “ A„lVi 

(19-79) 

We note that Xi 

— Ni. The logarithm of the quotient 



A;\r A^ -1 ,-AFnlkT 

Af " e 

(19-80) 


in Eq. (19-78) has the physical significance of a Gibbs free energy change, 
as indicated, since F = —kT In A in general. In Eq. (19-80), A Fs is 
the free energy change (non-pV work done on the system by the sur¬ 
roundings) for the process 

N systems with 2Vj, N 2 = 1, p, T —► 

1 system with N it N 2 = N, p, T 

+ 

N — 1 (solvent) systems with Ni, N 2 — 0, p, T. 

We can manipulate Eq. (19-77) just as we did Eqs. (15-6) (imperfect 
gas) and (19-10) (osmotic system). We first note that 


Nini(p, T, 0) = — kTlnAo, 


where /*i(P, T, 0) is the chemical potential of the pure solvent. Then 
if we define 

Mi (p, T, m 2 ) = mi( p, T, m 2 ) - Mi (p, T, 0), 
we have (Eq. 15-7) 


where 


- ^ ( ^ ° 2) = i In i = £ *i(J>, T)aL 


kT 


N i A 0 




l!i\Tifli = X\ = N i, 6i = 1, 

2\N\6 2 = X 2 - X\, 

3!ATi0s = X 3 — 3XiX 2 ■+■ 2Xf, etc. 


From the Gibbs-Duhem equation, 


(19-81) 


(19-82) 


= m 2 , 


(19-83) 
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we have 

m 2 (p, T, a 2 ) — 2 j e i(P> T)a i 2 . (19-84) 

it i 

The inverse of Eq. (19-84), in logarithmic form (Eq. 15-20), is 

In 7' 2 (p, T, m 2 ) = - 2 h(p, T)m 2 , (19-85) 

k>i 


where y' 2 (the solute activity coefficient) = a 2 /m 2) and where 


5j = 26 2 , 

82 = 30s — 60i, 


(19-86) 


etc. Finally, if we replace a 2 by m 2 as independent variable in Eq. (19-81), 
by use of Eq. (19-85) we find 


- — = ™2 + £ C»(P, T)m n 2 , (19-87) 

n>2 

where 

Cn= ~ !L ^ «n-i. (19-88) 


Equation (19-87) is the formal equivalent of the virial expansion of an 
imperfect gas. Equations (19-85) and (19-87) give essentially the expan¬ 
sions of the two chemical potentials in powers of the molality. 

Dilute solution. In a dilute solution 7’ 2 —* 1 and a 2 —> m 2 , according 
to Eq. (19-85). Hence, from Eq. (19-79), 

M 2 = kT In + kT In m 2 (m 2 -► 0). (19-89) 

In the notation of Section 19-1, we also have 

M 2 = p\(T) H - kT In f 2 (19-90) 

= M 2 (T) + kT In k 2 (p, T)x 2 ( x 2 —* 0), (19-91) 


where f 2 = fugacity of solute, x 2 = mole fraction of solute, k 2 = Henry’s 
law constant, and p 2 = chemical potential of solute gas c.t unit fugacity. 
Since x 2 —* m 2 as m 2 —* 0, comparison of Eqs. (19-89) and (19-91) 
yields, for the Henry’s law constant, 


If we write 


2 Ai 

k 2 Ao e~" 2lkT ,-n.FikT 

Ni~ Aj ~ 6 


(19-92) 

(19-93) 
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then A F is the Gibbs free energy change for the process 
System with N it N 2 = 1, p, T —♦ 


system (solvent) with N u N 2 = 0, p, T 

+ 

1 molecule of solute in gas at f 2 = 1. 


Of course, when the solution is not dilute, f 2 = k 2 m 2 y' 2 in Eq. (19-90), 
and m 2 is replaced by m 2 y' 2 in Eq. (19-89). 

For the solvent, we have in general 

- = - In 4 > (19-94) 

kT ft 


where ft is the fugacity of the pure solvent. For a dilute solution (Raoult’s 
law) 

— ps = — In (1 — x 2 ) -*• x 2 -* m 2 , (19-95) 


in agreement with Eq. (19-87). 

Osmotic pressure. We omit series expansions (Problem 19-11) for the 
partial molal volumes, heat contents, and entropies, t and the correspond¬ 
ing extensive properties, but we consider the osmotic pressure briefly. 
Suppose we have osmotic equilibrium between the solution at p, T, m 2 
and the pure solvent at p — II, T: 

Mi(p, T, m 2 ) = m(p - n, T, 0). (19-96) 

But 

Mi(p, T, 0) - M,(p - n, T, 0) = r vtW, T) dp'. (19-97) 

/p—n 

Therefore, from Eqs. (19-87), (19-96), and (19-97), 

vm [ «’i(p', T) dp' = m 2 + J2 Cn(p, T)ml (19-98) 

kl J *~ n »>2 

This equation determines II as a function of p, T, and m 2 . If the pure 
solvent is incompressible, the left side becomes Tlvi/kT. 

Example. Consider the following simple model. The solvent is an 
inert incompressible fluid of volume Vq = N i»i, whose only role is to 


t See T. L. Hill, J. .4m. Chem. Soc. 79, 4885 (1957). 
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provide a suspension medium for solute molecules. The solute molecules 
are monatomic and interact with each other. These interactions are 
characterized by imperfect-gas type virial coefficients B\(T), B\(T), etc. 
The configuration integral for solute molecules, Z* N , can be written in 
terms of the virial coefficients as follows [Eqs. (15-13) and (15-16)]: 

Z*i = V, 

Z% = -2 VB% + F 2 , (19-99) 

Zt = —ZVB% + 12FBI 2 - 6F*B| + F 8 . 

The solution is also assumed incompressible, with volume 

F = F 0 + N 2 v 2 . 

For the pure solvent we write 

Q(Ni,0,V) =QoS(.V- Vo), 


where 5(F — Fo) is the Dirac 3-function (introduced because of incom¬ 
pressibility). In general 


Q(N U N, V) = Q°*[r ~ (Vo±N »* MM n , (ig-loo) 

N !A 3 


where Z%/N !A 3 8y is the canonical ensemble partition function of the solute 
molecules, with the Z% given by Eqs. (19-99). Then, from Eq. (19-76), 


and 


Ajy 


e- p{Vo+rrv2)lkT QoZUVo + Nv a ) 
NUlI" 


-AF N ikT _ ZHVo + ATp 2 ) 

6 “ Nl(V 0 + v 2 )» ‘ 


For example, to terms in N t 2 , 




+ 12 



- 18 


V2&2 


B$ , 1 

n + J’ 
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and 


02 = 


£2 

Vl 


Bt 

Vl 


= -C* 


03 


5 ( E?V _i_ o (BtY — 3 

2W + W 3 v\ 


1 m 

2 pf ’ 


Cz 



0*2B% B% 
v\ + v\ 


The osmotic pressure is given by 


nt>i _Mi 

kT kT’ 


(19-101) 


which can be shown without difficulty (Problem 19-12) to be equivalent to 
= P 2 + B*p\ + B*p% + • • •, (19-102) 

as expected [see Eq. (19-13)]. 

Relation to McMillan-Mayer theory. The coefficients B* of Section 19-1 
and the C» in this section are properties of the pure solvent under the 
same conditions: mi> T, and p(ji u T) or p, T, and Mi(p, T). We expect 
therefore that general relations exist between the and the C» [Eqs. 
(19-101) refer to a special case only]. From thermodynamics we findf 

2C 2 vi = 2 Bl - P§ + «i»i (19-103) 

and a similar but more complicated relation between Cz and B%, where 
P!j is the value of P 2 (partial molal volume) in the limit aa m 2 -* 0; aj 
(Problem 19-4) is the value of (dpi/dp 2 )T, H for the inside solution in 
the limit as p 2 —> 0; and v x is the volume per molecule in the pure solvent. 
In the example immediately above, a\ = — v 2 /v u P§ = v 2 , and therefore 

C2V 1 = Bt — l» 2 i 

in agreement with Eq. (19-101). 


t T. L. Hill, J. Chem. Phys. 30, 93 (1959). 
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Problems 


19-1. Find the characteristic thermodynamic function associated with the 
partition function ¥jr a in Eq. (19-2) and its relation to other thermodynamic 
functions [as in Eq. (1-73), for example]. (Page 342.) 

19-2. Replace the sum in ’Fo and ¥1 by a single term with Ni = ^1 and show 
that, for monatomic molecules, 


1 f 

f r e-^dm 


(19-104) 


[see Eq. (19-20) for notation]. Give a physical interpretation of this equation. 
(Page 343.) 

19-3. Use Eqs. (8-28), (15-5), and (15-8) to show that for, say, a pair of 
“hard” right circular cylinders, 

— 2VB* ■* J ( e WlkT — 1) si® d$i dtpi sin 82 dd 2 dtp 2 dli di 2 , 

where teisa function of ri 2 , 81 , 82 , <Pi, and <p 2 (to = +■» for cylinders overlapping 
and to ■» 0 otherwise). (Page 351.) 

19-4. It can be shownf that the inside/outside concentration ratio of a 
diffusible species A in an osmotic equilibrium is given by 

j| = I + 611 (*)/>+•••, (19-105) 


where 611 (A) is a cluster integral of the form (15-52) involving the potential of 
mean force toix(A), in the outside solution, between one solute molecule and one 
molecule of species A. (a) If the solute ions are spheres of diameter a and total 
surface chargeze, if all other ions are treated as point charges (see Problem 18-10), 
and if an ion of species A has a charge z*e, then ten (A) is given by Eq. (19-42) 
with a replaced by a/2 and z 2 by zz*. Show that 


611 (A) 


tra 3 zz* , z 2 z*a 
6 £ + 4[l + (*«/2)]2£' 


(b) If the solute binds A ions, show that 


611 (A) = 



zzk 1 _ z 2 zja 

E + 4[l+(«a/2)P£’ 


where the averages are defined by Eq. (19-67). (c) Show that in a two-com¬ 
ponent osmotic system, Eq. (19-105) leads to <*1 = bu/vi, where ai is the quan¬ 
tity appearing in Eq. (19-103). (Pages 352 and 368.) 


t T. L. Hill, /. Am. Chem. Soc. 80, 2923 (1958). 
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19-5. Show that the term in z 2 in Eq. (19-41) is a consequence of (a) the 
condition of electrical neutrality in the outside solution, and (b) w# = z<zy/(r) 
for an ij pair of ions in this solution, where/(r) is arbitrary. (Page 353.) 

19-6. Show that the w(r) defined by Eq. (19-58) is the potential of the mean 
force between two solute molecules if «o u ,(r) is the potential of mean force 
between an s-solute and an s'-solute. (Page 358.) 

19-7. Show that (19-61) is equal to dB' 2 /d In X. (Page 359.) 

19-8. In the special case (19-62) through (19-70), 3 is the mean value of < 
for a pair of solute molecules separated by r = « in the outside solution, 
(a) Let s be the mean value of s for a finite separation r. Show that 

3 = 3 — zz(*5 — (3)*]/(r) H- 

and give a physical interpretation of the result, (b) Let 3 be the mean value of 
« on one solute molecule of a pair if the other solute molecule is at r and has a 
fixed charge number z t ». Show that 

1 = 3- zz„fP - (J) 2 ]^) + • •, 

and interpret the result. (Page 360.) 

19-9. Derive Eq. (19-70) for P — 3* from «" = 

(Page 361.) 

19-10. In Eq. (19-74), put o = 60 A, « = 16, T = 298.1°K, e = 78.5, 
and determine the approximate concentration in moles-liter -1 of a 1-1 elec¬ 
trolyte (outside solution) at which the two terms making up Bj have the same 
magnitude. (Page 362.) 

19-11. Obtain expansions in powers of ms for ?i, Hi and 3i in a binary 
solution at p, T, ms. (Page 366.) 

19-12. Verify Eq. (19-102) for TL/kT. (Page 368.) 

19-13. Discuss the problem of an isomeric chemical equilibrium between two 
solutes in a very dilute solution at p and T (solutes = A, B; solvent = 1; use 
mA = Na/N i and m* = Nb/Ni as composition variables). 

19-14. Find an expression for kf, the Henry’s law constant, in a dilute binary 
solution (jS = solute, a = solvent), from Eq. (17-21). 

19-15. Translate Eqs. (15-40) through (15-49) into corresponding equations 
for a binary solute in the McMillan-Mayer theory. 

19-16. Show, from the general B£ referred to just preceding Eq. (19-74), that 

SB's zz{? - (3) 2 ] az?[z(7 - ?3) + 2z 0 (? - (3)*)] 

dlnX " Z 4(l + xo) 2 2: 


Supplementary Reading 

McMillan, W. G., and Mayeb, J. E., J. Chem. Phyt. 13, 276 (1945). 
S. M., Chapter 6. 
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CHAPTER 20 

THEORY OF CONCENTRATED SOLUTIONS 

The methods employed in the previous chapter made possible a rigorous 
formulation of the theory of dilute solutions. But these methods cannot 
be extended in a practical way to concentrated solutions. If a phase sep¬ 
aration occurs in the solution at higher concentrations, the methods of 
Chapter 19 are not applicable even in principle beyond the concentration 
of phase separation (because of divergence of the series expansions). In 
view of the fact that we are interested in the entire concentration range 
here, approximate theories must be introduced. 

This situation is completely analogous to that in gases and liquids: 
the imperfect-gas theory of Chapter 15 had to be discarded in treating the 
theory of liquids (Chapter 16). 

It is particularly appropriate to give only a brief introductory treatment 
in this chapter because of the existence of the two recent and excellent 
monographs on this subject by Guggenheim and by Prigogine (see Sup¬ 
plementary Reading list). In particular, these books should be consulted 
for detailed comparisons between theory and experiment. Section 20-1 on 
the lattice (or “strictly regular”) theory of solutions may serve as an in¬ 
troduction to Guggenheim’s book, and Sections 20-2 to 20-4 are related 
in the same way to Prigogine’s book. 

We study only binary solutions in Sections 20-1 to 20-3. But the meth¬ 
ods discussed can all be extended to any number of components. 

20-1 Lattice theory of solutions. In this section we consider a model 
of a binary solution which is sufficiently idealized so that we can take over 
the results of Chapter 14 on lattice statistics with only notational changes. 
The model is more nearly appropriate for a solid solution, but it is usually 
applied to liquid solutions. 

The system is a condensed, incompressible solution containing Na and 
N b molecules of the two components at temperature T. The molecules 
occupy sites of a regular lattice; there are no vacant sites. Each site has 
c nearest-neighbor sites. The lattice is rigid, that is, it has a fixed volume 
per site. The volume V is then not an independent thermodynamic vari¬ 
able, since it is simply proportional to Na + Nb = M (total number of 
sites). Hence, this model omits all p-V effects. The molecules are spherical 
or effectively spherical and the two species are of approximately the same 
size (otherwise they would not be interchangeable on the same lattice). 
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Each molecule vibrates about a lattice site with a (three-dimensional) 
partition function Qa{T) or ^(T 7 ), independent of the state of occupation 
of neighboring sites. If the molecules are polyatomic, Qa and qs include 
the rotational and internal vibrational degrees of freedom. We take into 
account nearest-neighbor interactions: the pair interaction energies are 
denoted by waa, wab, and w B b- That is, waa — waa(« 0, etc., where a 
is the nearest-neighbor distance. We treat the tc’s as constants, though 
we could consider them functions of temperature. Usually all the w’a have 
negative values. 

The thermodynamic equations for the Helmholtz free energy are 

dA = -SdT + ma dN a + mb dN B , (20-1) 

A = haNa + HbNb- (20-2) 

If we rewrite Eq. (20-1) as 

dA = -SdT + mb d(N A + N b ) + (ma - mb) dN A , (20-3) 

we have the analog, term by term, of the equation 

dA = -SdT - 4> dM + m dN (2CH1) 

for a lattice gas, where we have made the arbitrary association: occupied 
site <-» species A. 

The canonical ensemble partition function for the solution is 

Q(Na, Nb, T) = qA(T) NA q B (T) NB £ a, Na + N B , N AB )e~ wlkT , 

Nab (20-5) 

where 

W = N A AW A A + NaBWaB + NbbWbB- 

The notation is essentially the same as that in Eq. (14-20); the function 
g is that of Chapter 14. We introduce [see Eqs. (14-19), (14-27), and 
(14-28)] 

w = waa + w B b ~ 2wab, ( 20 - 6 ) 

cN a = 2N A a + Nab, (20-7) 

cNb = 2 Nbb + Nab, ( 20 - 8 ) 

into Eq. (20-5) and obtain 

Q = (qAe- CWAAl2kT ) NA (q B e- CWBBl2kT ) f,B X) Q^ AB , (20-9) 

Nab 
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where x = e wl2kT . The sum is now exactly the sum which occurs in lat¬ 
tice-gas theory [see, for example, Eq. (14-29)]; we denote it by 52 (a func¬ 
tion of Na, M, and T), below. 

Incidentally, we note that in this model pure A [put Nb — 0 in Eq. 
(20-9)] has the partition function 

Q(N a , D = (q A e- cw ^ mT )^, 


as in an Einstein crystal. If the solution is supposed to be a liquid, then 
q A should include a communal entropy factor e [see Eq. (16-22) and Sec¬ 
tion 20-2]. The interaction potential energy in pure A, relative to infinite 
separation as zero, is seen to be cw AA N A /2, as expected. The chemical 
potential /x A and vapor pressure p\ of pure A are 


M a _ 
kT ~ 


/d]nQ'\ _i Qaie -cw AA i 2 kT 
\ dN A ) T U 


MX (gas) _ ix a (T) 
kT ~ kT 


Inp®, 


( 20 - 10 ) 


assuming the vapor is an ideal gas. There is, of course, a similar equation 
for component B. 

Ideal solution. The first special case of Eq. (20-9) that we consider is 
w = 0; that is, w AA + wbb = 2«us. This means that, energetically, 
A and B molecules “like” the opposite species as well as their own species; 
or, that there is no energy change when an A molecule which is completely 
surrounded (nearest neighbors) by A molecules and a B which is sur¬ 
rounded by B's exchange places with each other. In Eq. (20-9), x = 1 
and, as in Eq. (14r-4), 

L (N a -f- Nb) 1 (20-11) 

N a \Nb\ 

This corresponds to the ideal lattice statistics of Chapter 7. We can now 
put Eq. (20-11) for 2 in Eq. (20-9) and find the thermodynamic func¬ 
tions of interest. For example, 


Mx (gas) _ mx(T) 
kT ~ kT 


+ In p A , 


where x A (not to be confused with x) is the mole fraction N A /(N A + Nb) 
of A in the solution, and where p A is the partial pressure of A above the 
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Fig. 20-1. Vapor pressure curves lor Bragg-Williams binary solution. 


solution. Using Eq. (20-10), 



(20-12) 


Thus with this model, Raoult’s law is obeyed over the whole composition 
range (Fig. 20-1) when w — 0. It should be recalled that, besides w = 0, 
we are also imposing the implicit condition that A and B molecules be 
near enough in size to'fit into the same lattice. In thermodynamics* a 
solution exhibiting the behavior (20-12) is called “ideal.” 

Component B also follows Raoult’s law, of course. This is, in fact, a 
purely thermodynamic consequence of Eq. (20-12) for A (Problem 20-1). 

From the standard canonical ensemble equations (Section 1-4), we 
find for the entropy 

S = N A (tT + Hn «) + (kT -h Mil?.) 

The last term arises from the configurational degeneracy. If we define 
A S m (“entropy of mixing”) as the entropy change in the process 

Na molecules 
Nb molecules 


of pure A at solution with 
of pure B at T) Na ’ Nb ’ T ' 
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Fig. 20-2. Entropy of mixing in an ideal solution as a function of mole 
fraction of component B. 


then 


A£n 

Mk 


—xa In xa — xb In xb > 0. 


(20-14) 


In ordinary solution thermodynamics, “mixing” is defined at constant 
p and T, but we need not specify p above because of the absence of p-V 
effects in this model. The function (20-14) is plotted in Fig. 20-2. We 
find also 


A A m _AS* 

MkT ~ Mk ' 


(20-15) 


AE m = 0. 


(20-16) 


Bragg-WUliama approximation. We recall that in this approximation 
(Section 14-4) the molecules are assumed to be distributed among sites 
in a random fashion, despite molecular interactions. Equation (20-9) 
becomes in this case 

Q = (qAe- cv ’ AAl2kT f A {( l Be- tv ’ BalikT f B ’ (20-17) 

where Uab = cNaNb/M. Then 


Pa _(3 In q\ 

kT \dN a /n b .t 


= In xa -\nqAe- eKAAl2kT 


cw( 1 — xa) 2 
2 kT 
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Fio. 20-3. Critical vapor pressure curves for Bragg-Williams binary solution, 
and therefore 

£4 = 1 -**>*' 2kT , (20-18) 

Pa 

with an analogous equation for pb/p°b (Problem 20-1). We plot Eq. 
(20-18) in Figs. 20-1, 20-3, and 20-4 for various choices of cw/kT. 

In Fig. 20-1, the value cw/kT — 0 leads to ideal behavior, as already 
mentioned. If we take w negative, say cw/kT — —2, this means, accord¬ 
ing to Eq. (20-6), that Waa + wbb is more negative than 2 wab (AA 
and BB pairs are more stable, energetically, than AB pairs). This gives 
so-called “positive” deviations from Raoult’s law, as shown in the figure: 
the presence of B molecules increases the “escaping tendency” of A mole¬ 
cules, and vice versa, compared to ideal behavior, because of the “dislike” 
of A and B for each other relative to A for A and B for B. When w is 
positive, we have, on the other hand, “negative” deviations from ideality 
(Fig. 20-1, cw/kT = 4). 

We found that cw/kT = — 4 leads to critical behavior in a lattice gas 
(Section 14-4). The same is true here, as should be expected, and as can 
be seen in Fig. 20-3. When cw/kT < —4, the t\vo components are no 
longer miscible in all proportions; over part of the composition range we 
get a separation of the solution into two phases of different composition 
(Fig. 20-4). In one phase the majority of molecules are of type A and in 
the other, of type B. We can understand this as follows: if A A and BB 
interactions are sufficiently favored over AB interactions (or if the tem¬ 
perature is low enough), the system splits into two solutions with excess A 
in one and excess B in the other in order to have the advantage of a larger 
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Fio. 20-4. Vapor pressure curves for Bragg-Williams binary solution showing 
phase separation. 

number of AA and BB interactions. The location of the horizontal line 
(stable equilibrium curve) is fixed in Fig. 20-4 by the thermodynamic 
requirement that the chemical potential (or vapor pressure) of each com¬ 
ponent be the same, at equilibrium, in the two solutions of different 
composition. 

It will already be apparent to the reader that this very simple (lattice, 
Bragg-Williama) theory of solutions predicts correctly some of the most 
important qualitative features observed experimentally with binary solu¬ 
tions. Of course, the theory cannot be expected to be satisfactory in a 
quantitative way. 

We can also easily derive from the Bragg-Williaras Q (Eq. 20-17) the 
following “mixing” properties (Problem 20-2): 

= xa In x A + x B In x B - ^ x A x B , (20-19) 

^ = -xa hi x A — x B In x B , (20-20) 

AF m _ cw /on-on 

MkT ~ 2kT A B ' (20-21) 

The entropy of mixing is the same as for an idfeal solution. This is a con- 
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XB 



Fig. 20-5. Free energy of mixing for Bragg-Williams binary solution at 
temperatures above and below the critical temperature ( cw/kT, — —4). 

sequence of the “random distribution” assumption in the Bragg-Williams 
theory. If w is negative (AA, BB more stable than AB), the energy of 
mixing is positive, as expected. The free energy of mixing is plotted in 
Fig. 20-5 for cw/kT = —3 and cw/kT = —4.5 (phase separation). The 
horizontal line (mixture of two solutions of different composition) is the 
stable equilibrium curve. This curve has a lower free energy than the 
dashed metastable curve. 

It is useful for many purposes in the study of solutions to define “excess” 
quantities relative to the values the quantities would have if the pure 
components at a given pressure and temperature formed an ideal solution 
(at the same pressure and temperature). For example, for the entropy, 

AS, = S — S (ideal) = AS m — AS m (ideal). (20-22) 
Thus, in the Bragg-Williams approximation, 

Wkf = ~2kf XaXb ’ (20-23) 

AS, = 0, (20-24) 

AE, = AA,. (20-25) 

In experimental work it is in general found that AE, and T AS, are of 

the same order of magnitude. The fact that the present approximation 
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predicts AS, = 0 is therefore a serious fault. In the next subsection we 
find that higher approximations (lattice model) do not improve this 
situation appreciably. 

Other criticisms of the lattice model, which, incidentally, are also in¬ 
dependent of the order of approximation used in treating the model, are 
the omission of p-V effects and the predicted symmetry in various proper¬ 
ties about Xa = £, which is not always observed experimentally. 

Quasi-chemical approximation. It is somewhat more troublesome to 
rederive binary solution equations in this approximation, so instead we 
take over results from Section 14-5 (lattice gas in the same approximation). 
To do this we take advantage of the fact that Q gas in Eq. (14-29) and 
Qsoiution in Eq. (20-9) contain the same sum £. From Eqs. (20-3), (20-4), 
and (14-62) we have 


MB _ / aln&pA _, -tx, BB l2kT _ ± 

kT \ dM J Na . t ~ qB k7 


dM / n a ,t 
- In q B e~ ev>BBl2kT + In | In » ■ ;£ 


where 

Then 


/3 = [1 — Axax b {1 — e 


0 

-w/iT^l/2 


kT 

)xb 

+ 2xb 


PB 

Pb 


~x - 1 

~ XB r(F+ 


- 1 + 2 xb ] cI2 

+ l)z* J 


(20-26) 


(20-27) 


Equations for pa and pa may now be written down by symmetry consid¬ 
erations. The qualitative behavior of the vapor pressure equations is 
the same as in Figs. 20-1, 20-3, and 20-4. The critical temperature is 
given by Eq. (14-63). 

The free energy is 

A _ xaPa i xbpb 
MkT ~ kT ~ 1 ' kT 


From Eq. (20-26) and its counterpart for component A, we have then 


A A, 
MkT 


= | xa In 


0 


1 + 2 x a , .. 0 - 1 + 2 x B 

- + XBlD (0 -r 1 )*b 


(0 + l)*^t 




(20-28) 


The total nearest-neighbor (“configurational”) interaction energy, 
F 0 onfi*, is, from Eqs. (20-6) through (20-8), 

Eeonftt = NaaWAA + NaBWaB + N%bW B B 

Njlbw , cwaaNa , cwbbNb 
=- 2 ~ +-2-+-2-’ 
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where 


Then 


N* ab 

cM 


AE m 


A.E. 


MkT MkT 


2 xax b 
fi +1 

NabW CWXaXB 

2MkT ~ kT(p + 1)' 

A A, then follows from Eqs. (20-28) 


(20-29) 


The expression for T AS, = A E, 
and (20-29). 

The significance of the results for the excess functions can best be seen 
by converting them into (high-temperature) power series in cw/kT. We 
find (Problem 20-3) 


A A, 
MkT 

AE, 
MkT 

AS. 

Mk 


xaxb 

2 

XAXB 


AE. A A, 
MkT MkT 


(_ x * x b (cw \ 2 , 

\kTj 4c \kTj 

/ cw\ _ xWb (cw \ 2 4 
\kTj 2c \kT/ 

_ xax I ( cw\ 

4c \kT/ 


(20-30) 

(20-31) 

(20-32) 


Actually these series are exact, for a lattice model, as far as they go (in 
fact the quasi-chemical equations also give the exact cubic termsf). The 
linear terms are just the Bragg- Williams results (see Problem 14-17); they 
are associated with random mixing (T —* oo). The quadratic terms are 
the first corrections for nonrandom mixing. Note that AS, is always 
negative, as should be expected in this model from any reduction in ran¬ 
domness. But experimentally, AS, is sometimes positive. 

To get an idea of the orders of magnitudes involved in the above equa¬ 
tions, let us take a numerical example: xa = xb = 0.5, c = 10 (a good 
average for a liquid), and cw/kT = —4 (a rather low temperature). Then 

A A, 1 1 A£„ 1 1 AS„ 1 

MkT 2 40 ’ MkT 2 20 ’ MkT 40' 


We see that the correction for nonrandomness is very small. Incidentally, 
the ideal mixing terms that have been subtracted out here [Eqs. (20-14) 
and (20-15)] are In 2 = 0.69 for the entropy and —0.69 for the free 
energy. The main conclusion we reach is that improvement of the Bragg- 
Williams approximation does not provide an excess entropy term T AS, 
of the same magnitude as A E„ as found experimentally. This difficulty, 


t See Guggenheim, pp. 62-70. 
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as well as the fact that AS„ is always negative, is therefore a property of 
the lattice model itself and not of approximations introduced in deriving 
thermodynamic functions from the model. 

However, it should not be concluded from the above discussion that 
nonrandom mixing corrections are never important. For example, the 
location of the critical point is quite sensitive to such corrections [see 
Eq. (14-63), et seq.]. 

Because of the small correction found for A S t in Eq. (20-32), the ap¬ 
proximation of random mixing can appropriately be introduced, for mole¬ 
cules of like size, in lattice or cell solution theories that are otherwise 
fairly sophisticated (e.g., Section 20-2). Random mixing should not be 
assumed in such theories, however, if the molecules are very different in 
size (say have more than a 25% difference in diameters). 

Solid solutions and alloys. The lattice theory of solutions has been ap¬ 
plied recently by Halsey et al., in considerable detail, to solid binary solu¬ 
tions of the rare gases.* One would expect that this kind of system would 
put the lattice theory in the best possible light. But, at least for argon- 
krypton solutions, the conclusion is reached that the experimental results 
cannot be explained by the present lattice model. We shall not attempt a 
comparison between experiment and the above equations because w was 
treated in Halsey’s work as temperature-dependent. Negative values of 
w and critical solution phenomena are observed. 

Another example of the application of lattice theory to binary solid 
solutions is found in the so-called order-disorder transition! in alloys 
(e.g., |8 brass, an alloy of copper and zinc). In this case w is positive and 
phase separation does not occur. However, a higher-order transition 
(“order-disorder”) is observed, for example, in the heat capacity as a 
function of temperature at constant composition (say xa = 0.5). The 
experimental heat capacity curves resemble the square lattice curve in 
Fig. 14-3. Qualitatively, the nature of the transition is the following. 
Suppose the lattice is simple cubic and x\ = 0.5, for ease of discussion. 
At low temperatures, because of the greater energetic stability of AB 
pairs relative to A A and BB, the molecules will tend to arrange them¬ 
selves alternately in the lattice (• • • ABAB • • •), as for example in a 
crystal of NaCl. There is also a second and equivalent way to do this 
(• • • BABA ■ • •). In either case there is long-range configurational order of 
the type found in crystals but not in liquids (or in a solid solution when w 


* J. F. Welling and G. D. Halsey, Jr., J. Chem. Phys. 30, 1514 (1959); 
J. Phys. Chem. 62, 752 (1958); J. H. Singleton and G. D. Halsey, Jr., J. 
Phys. Chem. 58, 1011 (1954); M. Freeman and G. D. Halsey, Jr., /. Phys. 
Chem. 60 ,1119 (1956). 
t See Guggenheim, Chapter 7, for details. 
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is negative). The extent of long-range order varies with temperature and 
completely disappears above a critical temperature T e . Hence the term 
“order-disorder transition.” It can be shown rigorously, by a symmetry 
argument,* that the heat capacity as a function of temperature, of a lat¬ 
tice model with x A = 0.5, is the same for a given magnitude of w, whether 
w is negative (phase separation below T c : • ■ • AAA ■ • • BBB • • •) or 
positive (long-range order below T e : • • • ABAB • • •). See, for example, 
Fig. 14-3. It follows from this that the temperature T c of the transition 
is the same for the two systems (id positive or negative). For a square 
lattice, e ~ wl2kTc — \/2 — 1 exactly (Section 14-3), but the exact 
theoretical critical temperature is not known for any three-dimensional 
lattice. 

Experimental heat capacity curves for binary solutions with negative 
w, at fixed (critical) composition, have been obtained recently, t They 
resemble Fig. 14-3, as expected. 

20-2 Cell theories of binary solutions. We saw in the preceding section 
that our earlier study (Chapter 14) of lattice problems could easily be ap¬ 
plied to binary solutions. Our object here is very similar: to show how the 
UD cell theory of liquids (Section 16-2) can be extended to include binary 
solutions. 

Our starting point is a modification of Eq. (20-5). We assume that the 
molecules are similar in size and that we have a lattice with every site 
(cell) occupied by an A or a B. The nearest-neighbor number is c and the 
nearest-neighbor distance is a. In view of our findings in Section 20-1, 
we assume further that we have a random distribution of molecules among 
sites. Then we write 

0 - - jQyvr <a>-*» 

where W is the average configurational potential energy, calculated on a 
random-distribution basis, with each molecule at the center of its cell. 
That is, 

JP = TVaaWaa + T^abWab + NbbWbb 

= (xaWaa + 2x a xbWab + xbWbb). (20-34) 
So far we seem to have just Eq. (20-17) for the Bragg-Williams lattice 


* See S. M., Chapter 7. 

fG. Jura, D. Fraga, G. Maki, and J. H. Hildebrand, Proc. Nat. Acad. 
Set. 39, 19 (1953). 
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theory. But now we introduce a refinement, which brings in the UD 
theory. We evaluate qa and qs explicitly by a cell calculation of the UD 
type, assuming that the c neighbors of a given molecule, smeared uni¬ 
formly over the surface of a sphere of radius a, consist of cxa A molecules 
and cxb B molecules (again using the random distribution assumption). 
Thus Qa and Qb are now not only functions of T but also of t> = V/M 
and xa (or xb). Unlike the lattice theory, the present treatment does not 
make the solution incompressible, and V is an independent thermodynamic 
variable as it is in real solutions. V is related to a by Eq. (16-24): a 3 = 
7 v, where ordinarily we take 7 = \/2 (and c = 12). The thermodynamic 
equation for the Helmholtz free energy is 

dA — — S dT — p dV /xa dNA -(■ Mb dN b, (20-35) 

A = -kT\n Q. 

Let us now begin to change notation to conform with Section 16-2. 
For each type of pair interaction, we use the Lennard-Jones potential 
(16-28) with parameters €aa, **aa, «ab, t*ab, etc. Incidentally, in numerical 
calculations, in the absence of other information, one usually takes (Eq. 
15-53) 

«AB = (<AA«BB) 1/2 , 

_* TAA + T%b 

Tar = -=-- 


We assume A and B are both monatomic, so that 




(20-36) 


as in Eq. (16-22). Polyatomic molecules with effective spherical sym¬ 
metry can be treated simply by changing the definition of A (T) (Problem 
20-4). The factor e is included, rather arbitrarily, for a liquid (see Section 
16-2). Actually, this factor or any other constant factor will have no ef¬ 
fect on the equation of state, on excess or mixing properties, or on Pa/p°a, 
etc. The free volume for an A molecule is (Eq. 16-25) 

vf{v, x A , T) = J e~* AlkT 4vr 2 dr, (20-37) 

'l 'a = ?a (r) — pa(0), 


where vaW is the potential energy of interaction of an A molecule in its 
cell at r with its cxa A neighbors and cxb B neighbors. Instead of 
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Eq. (16-27), we have 

««-§ L l xaWaa(R) + xbUab(R)] sin 6 d 6 . (20-38) 


On writing out xauaa + xb^ab in detail, using the Lennard-Jones po¬ 
tential, we see that this quantity is equivalent to a single potential ua(R) 
with average parameters e A and r\, which are functions of composition, 
determined by 

(AfA 6 = Xa^AA^AA + %B*AB r ABi 


>•-*12 „ - -* 121 - - -*12 
*A*A = %A*AATaA + XbCabTAB • 


(20-39) 


These equations can easily be solved for e A and r*f. Thus ^a, <pa( 0), and 
vf depend in exactly the same way on (a and v* A = r^/y as do 4 ', <p(0), 
and Vf on e and v* in Eqs. (16-34) through (16-36). For example, if 
g(v*/v, e/kT) is the function in Eq. (16-36), then 

vf = 2ira 3 g(v\/v, e A /kT), 


where v\ and are functions of xa- Completely analogous results are 
of course found for vf, etc. This means that numerical tables computed 
for the one-component liquid LJD theory can be employed here without 
alteration. 

For the interaction potential energy in the LJD notation, we have 


jp _ Na<pa( 0) Nb<pb{ 0) ( 

2 2 

where, from Eq. (20-38), 

¥>a(0) = c[x A UAA(a) + XbUab(o)], 


<pb( 0) = c[xau A b(o) + XBUBBia)]- 


(20-40) 


It is easy to see that Eqs. (20-34) and (20-40) are the same, since w A a — 
«aa(o), etc. 

The random-distribution assumption can be pushed one step further 
in this theory, to simplify matters a little more. Here we consider all cells 
as equivalent (as they are, on the average) and use a single free volume ty 
based on a potential energy that is averaged (random distribution) not 
only over the two kinds of nearest neighbors but also over the two kinds of 
occupation of the cell itself (A or B). Thus, in Eqs. (20-33) and (20-36), 
instead of (vf) lfA (vf) NB , we have vf, where 

Vf = j e~* lkT iirr 2 dr, 

<!> = < p ( r) — *>(0), 


(20-41) 
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and 

<p(r) = | J [xaUaa(R) + 2xax b uab(R) + x%ubb(R)] sin 6 dO. (20-42) 

If we replace the expression in square brackets by a single Lennard-Jones 
potential u(R) with average parameters e and r* (functions of composi¬ 
tion), we find that « and r* are determined by the equations 

er*® = Xa tAAXA^A + 2xaXb€abXAB + X%fBBT%% 

(20-43) 

er* 12 = X 2 AtAAT*A l A + 2 x A XBlABT*AB + XB€BBr%B, 


which are again easy to solve. In this case, then, we get the UD functions 
(only one set this time) #>(0), and ty in the same notation as in Section 
16-2, but we have to keep in mind the fact that e and r* (or v*) are func¬ 
tions of composition. 

The interaction potential energy is 

W = , (20-44) 

where, from Eq. (20-42), 


¥>(0) = c[xaUaa(o) + 2x A x B u A B(a) + x | ubb ( o )]. 

This is the same W as in Eq. (20-40). 

From Eqs. (20-33), (20-36), and (20-44), the canonical ensemble parti¬ 
tion function in the single free-volume approximation is 


n (v/e) M (N A + N B )le- M « W2kT 
W ” Aj*5*A V'NaWbI 


(20-45) 


or 

= x A In + x B In A| — In v t e + x A In x A + x B In x B + * 

(20-46) 


where »/ is a function of v, T, and xa, and ?(0) is a function of v and 
xa- The equation of state is found from 


-V = 


'dd.\ 

3V/t,n a .n b 



(20-47) 


Since ty and <p( 0) are the same functions as in the UD theory [except for 
the dependence of € and v* on x a, which, however is constant in the dif- 



386 


THEORY OF CONCENTRATED SOLUTIONS 


[CHAP. 20 


ferentiation (20-47)], we get the same equation of state (but with € and 
v* functions of x A ), (16-37). We are usually interested in solution be¬ 
havior at low and constant pressure. In computational work, p — 0 
is often chosen, since for a condensed phase most properties are relatively 
insensitive to p. If we choose T and take p = 0 (or p = any constant), 
Eq. (16-37) then provides v = V/M as a function of x A . 

Now A/MkT in Eq. (20-46) is a function of v, T, and x A . If we want 
p, T, and x A as independent variables, v can be determined as a function 
of these variables from the equation of state, (16-37). Therefore we may 
consider v in Eq. (20-46) to be replaced by v(p, T, x A ), so that A/MkT 
becomes a function of p, T, and x A . 

We know from thermodynamics that for an ideal solution (defined by 
stating that the fugacities f A and }b obey Raoult’s law), the quantities 
AF m , AE m , and A# m are all zero, for mixing at constant p and T, while 
AS m is given by Eq. (20-14). These properties require, in the present 
theory, that for an ideal solution 

'AfJfcT = XjL ^ A* + xb In A| — x A In v/(x A = l)e — xb In v/(x B = l)e 
+ ZA In + ** In xb + * MOt * A = 1} ± X ^?- B = 1} • (20-48) 


The quantities labeled x A = 1 and xb = 1 refer to pure liquid A and B, 
and the independent variables are p, T, and x A , as explained above. Hence 
v is in general different in the terms labeled x A = 1 and xb = 1 (i.e., 
pure A and B will usually have different molar volumes at the same 
pressure). Equation (20-48) necessarily gives 


A A m (ideal) 
MkT 


x A In x A + xb In xb, 


as required. From Eqs. (20-46) and (20-48), we have for the excess Helm¬ 
holtz free energy, 

Aj4 

= x A In v f (x A = 1) + x B In v/(x B = 1) - In vj(x A ) 

-f <p( Q ' x a) ~ x A <p(0, x A = 1) — x B <p(0, *b = 1) f (20-49) 
2 kT 

where p, T, and x A are independent variables and v is in general dif¬ 
ferent in the terms labeled x A = 1, x B = 1, and x A . All terms in 
Eq. (20-49) contribute to A E e , but only the v / terms contribute to AS, 
(Problem 20-5). The <p terms are essentially equivalent to Eq. (20-23) 
in the Bragg-Williams lattice theory (Problem 20-6), but are not exactly 
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the same because the nearest-neighbor distance a is different in pure A, 
pure B, and in the solution at the same pressure. It is clear from the form 
of the Vf terms that AS, might be positive or negative. 

We shall not pursue this subject further in any detail. It is sufficient to 
say that Salsburg and Kirkwood* have made detailed calculations based 
on Eq. (20-45), but with the added refinement of including three shells of 
neighbors in the calculations instead of only one. These authors have 
compared their results with a number of experimental systems. A very 
considerable improvement over the lattice theory is found, and complete 
qualitative agreement with experiment is achieved. In particular, AS, 
is calculated to be of the correct order of magnitude, though always some¬ 
what too small. The theory gives the correct sign for AS, (usually positive), 
unlike the lattice theory which always has AS, < 0. Both AV, and AH, 
are in general too large (but the lattice theory does not even consider 
the volume: AV, = 0 always). Usually experimental values of AH, and 
AV, have the same sign. We shall see that first-order conformal solution 
theory (Section 20-4) predicts that AH, and AV, always have the same 
sign. Recent experimental cases in which these two quantities have op¬ 
posite signs have been observed (e.g., neopentane and carbon tetrachloride). 
The cell theory under discussion here correctly predicts this sign reversal 
in such cases. 

20-3 Random-mixing, corresponding-states theory. The treatment in 
the preceding section has one serious source of error, which is easy to rem¬ 
edy: it is based on LJD theoretical functions for a pure liquid. If we set 
for ourselves the less ambitious goal of predicting properties of the solution 
from those of the pure liquids rather than from first principles, then instead 
of using LJD functions for the pure liquid state, we may use experimental 
(corresponding-states) functions. In this section we show briefly how this 
can be done. Otherwise we employ the methods (random mixing, average 
e and r*) of Section 20-2. As we should expect, since experimental in¬ 
formation is built into the “theory,” agreement with experimental data 
on solutions is better, quantitatively, than with use of the UD theory, t 

The method of this section was suggested independently by Brown, t 
Prigogine et al.,t and Scott, t 


* Z. W. Salsburg and J. G. Kirkwood, J. Chem. Phys. 20, 1538 (1952); 21, 
2169 (1953). See also I. Prigogine and G. Garikian, Physica 16, 239 (1950); 
I. Prigogine and V. Mathot, J. Chem. Phys. 20, 49 (1952); J. A. Pople, Trans. 
Faraday Soc. 49, 591 (1953). 
t See Prigogine, Chapter 11, for a detailed summary, 
t W. B. Brown, Phil. Trans. Roy. Soc. (London ) 250, 175, 221 (1957); I. 
Prigogine, A. Bellemans, and A. Englert-Chwoles, J. Chem. Phys. 24, 518 
(1956); R. L. Scott, J. Chem. Phys. 25, 193 (1956). 
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We recall [Eqs. (20-33) through (20-40)] that in extending the LJD 
theory to mixtures we passed from the one-component equation 


Q 


_ ^ v/ee 




A» 


Y 


/ 


to the two-component equation 


Q 


= (vfee- ,A(0)l2kT Y A ( v?ee-** mi2kT \ 
' ' ' A^ f 


f ee -* B mi2kTy B (Na + Nb)[ 


NaW b \ 


(20-50) 


(20-51) 


where vf and <pa( 0) depend on the average parameters e A and r\ deter¬ 
mined by Eqs. (20-39). Here the procedure is identical except that the 
starting point is the experimental corresponding-states equation (16-45). 
That is, the one-component equation is, instead of (20-50), 




*H(e/kT, ti/r* 3 ) 
A« 


r. 


and the two-component equation is assumed to be 


r\M^/kT, v/r?) Y* \ r* B a *(e B /kT,v/r% 3 ) ] N * (N A + N B )\ 

A a A J L A% J N A \N B \ ’ 

(20-52) 


where «x, r*, e B , and r B are average values still determined by Eqs. 
(20-39). These equations refer specifically to monatomic molecules, but 
if we change the usual definition of A (T) (see Problem 20-4), they also 
apply to polyatomic molecules (with effective spherical symmetry) that 
obey the law of corresponding states. 

Just as we used a single V/ and #>(0) in Eq. (20-45), we can simplify 
(20-52) by writing 


[r* 3 V(e/kT, v/r* 3 )] M (N A + N B )\ 
A ™a a 3/b Na \n b \ 


(20-53) 


where « and r* are averages found from Eqs. (20-43). 

We shall not extend this discussion any further. Prigoginet (Supple¬ 
mentary Reading list) goes into the subject in great detail. It will be ob¬ 
vious (Problem 20-7) from the LJD equations (20-46) through (20-49) 
how to compute the equation of state, A A e , etc. In fact, the LJD equa¬ 
tions may be considered a special case of the present discussion in which 
an approximate theoretical function is used for ¥ instead of an empirical 
one. 


t Chapters 9 and 10. 
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20-4 Conformal solution theory. Equation (20-53) is an approximate 
equation applicable to a binary mixture of liquids, each of which obeys 
the law of corresponding states. In writing Eq. (20-53) we have assumed 
a random distribution of the two components in: (a) the configurational 
factor M\/Na.\Nb } -\ (b) the occupation of any given element of volume 
[this is implied in the use of a single 4 r —see the discussion preceding Eq. 
(20-41)]; and (c) the occupation of an element of volume in the neighbor¬ 
hood of a fixed molecule of either component [this is implied in the use of 
average parameters r* and e from Eq. (20-43), based on a random dis¬ 
tribution]. Now there would redUy be random mixing and these assump¬ 
tions would be exact if both components had the same interaction param¬ 
eters (e.g., an isotopic mixture), say « 0 o and rS 0 . Similarly, in the lattice 
model, the random mixing assumed in the Bragg-Williams approximation 
is exact if w = 0. Actually, the assumption of random mixing in the lat¬ 
tice model leads to the exact first-order or linear terms if we use expansions 
in powers of w/kT as in Eqs. (20-30) through (20-32). This is not a special 
situation, but is in fact a property of perturbation methods generally 
(e.g., in quantum mechanics): exact first-order terms are obtained by 
averaging with unperturbed (in our case, random mixing) weights. Whereas 
the expansions were in powers of w/kT in the lattice model, here we 
should use expansions in powers of e A A — <oo> e AB — «oo, etc., or the 
equivalent, since the unperturbed state (with random mixing) corresponds 
to these quantities having the value zero. We therefore reach the follow¬ 
ing conclusion: use of the random-distribution assumptions listed above 
and Eq, (20-53) will lead to an exact first-order (in Saa — too, etc., or 
equivalent) solution theory for molecules obeying the law of corresponding 
states. This means that the components must have very nearly the same 
interaction parameters e and r*. An equivalent statement is that it must 
be possible to choose parameters e 0 o and r*o of a reference component 
(hypothetical, or, more often, one of the components in the solution) such 
that €aa — Coo, etc., are all small. 

This general approach is referred to as conformal solution theory and is 
due to Longuet-Higgins. f The argument we use here (for continuity and 
simplicity) is somewhat different from that of Longuet-Higgins. It is 
based on Eq. (20-53) as the starting point and is therefore restricted to 
first-order effects. Second-order conformal solution theory has been dis¬ 
cussed in detail by Brown. X 

Without complication we can easily generalize Eq. (20-53) in two ways: 
(a) we consider any number of components; and (b) instead of restrict- 


f H. C. Longuet-Higgins, Proc. Roy. Soc. 205A, 247 (1951). For a review, 
see Prigogine, Chapter 4. 

t W. B. Brown, Proc. Roy. Soc. 240A, 561 (1957). 
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ing ourselves to the Lennard-Jones potential, we use (Eq. 15-33) u(r) — 
eh(r/r*) for each pair interaction. 

For the reference component, 

«oo = *ooh( -r\, 

vw 


and for the interaction between a molecule of component i and one of 
component j, 



(20-54) 


Following Longuet-Higgins, we define fa [not to be confused with /y in 
Eq. (15-38)] and 0 y by 

€ij = fijtooi r ii — ~~ > (20-55) 

va 

and we shall use expansions in powers of /y — 1 and ga — 1 (the ref¬ 
erence component has f 00 = g 00 = 1 ). 

Let us rewrite Eq. (20-53) for a multicomponent system and at the 
same time omit the factors A?^*, calling the remaining partition function 
the configurational partition function Q e : 


<3«(N, V, T) 


[r*H{t/kT, v/r* 3 )] M Ml 

UiNil 


(20-56) 


where M = » *» V/M, r* and e are (random) average parameters, 

and ¥ is an empirical corresponding-states function applicable to each 
pure component. For example, for pure s, 



(20-57) 


For a hypothetical (isotopic) reference solution with all parameters the 
same as e 00 and rJo, 

Qe(N, V, T) = (20-58) 

At this point, let us calculate the first-order random average param¬ 
eters r* and e for use in Eq. (20-56). We have (x = mole fraction) 


r* = x&jr*j = 2 x&j ^» 
i) ij Vii 
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or 

r* - r*o = 

= r* 0 2 *<*i (zr “ 1 )' 

it w« / 


Thus to first-order terms in gy — 1, 



* 

11 

CH* 

O 

1 - Yl, Briton ~ !)] 

a 0 


and 

■ 

,*3 _ _*3 

r — r 0 o 

_ j 

1 - 3 xnjigy - 1)1. 

(20-59) 



a J 


Similarly, 

€ == €00 

^1+2 XiXjtfy ~ l)j • 

(20-60) 


We notice, from Eq. (20-58), that 

o<Vn v> T'\ - Ko 3 *(eAr, v/r")\ u M\ 

v t 1 ) TT »r I ’ 


where V' = Vr%l/r* 3 and T' — Tt oo/«- Comparison with Eq. (20-56) 
then gives 

Q C (N, V, T) = Q?(N, 7', r). (20-61) 


Next, we want to expand In Q?(N, V', T') in powers of the gy — 1 
and fy — 1. This, together with Eq. (20-59), will give us lnQ e (N, V, T) 
(that is, the logarithm of the configurational partition function of the 
actual solution of interest), expanded in powers of the gy — 1 and /,, — 1. 
To linear terms, 


In Q?(N, V', T') = lnQ<?(N, V, T) 


+ 




+ 


d In Qo(N, V 1 , T’) \ 

. 9gy / H.r.r 

/—s—i 

ainQg(N, V', T') \ 

« dfy A 


«,V,T 
f-t -1 


(ffa ~ l) 
if a- l)]’ 


(20-62) 


In Q?(N, V, T') dlnQ° dV' 3 pV _ 3p°Vx i x i 

dgy ~ dV 1 dgy ~ kT XiX} kT 


where 
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and we have used Eq. (20-59) in differentiating V'. The pressure 
p°(M, V, T) is the pressure of the reference solution (or of the pure 
reference component at the same M, V, and T, since the intermolecular 
forces are the same in either case). Similarly, from Eq. (20-60), 

d In Qj?(N, V TO = dlnQc <*T _ _ _ Elx&j 

dfij dr dfij /,ff—*i kT ’ 

where E°(M, V, T) is the configurational energy (intermolecular potential 
energy) of the reference solution (or of the pure reference component at 
M, V, and T). Then Eq. (20-62) becomes 

In Q?(N, V’, T) = In Q?(N, V, T) 

+ £ *<»f [^fr -d-§(/«- d] + • -. 

Finally, we put this result and Eq. (20-59) in Eq. (20-61) to obtain 
A.( N, V, T) - i4®(H, V, T) 

= ^ XiXj[Ec(fij - 1) + 3(MkT - p°V)( gii - 1)] + • • •, (20-63) 
a 

where A c — —kT In Q c is the configurational Helmholtz free energy. 
This exact and elegant result expresses A e for the solution in terms of the 
intermolecular interaction parameters fa and gn and of the properties 
of the pure reference component jE 7°, p°, and 

<i-com P (M, V, D = A?,, o1 (N, V, T) - MkT J In *<. (20-64) 

% 

The A° in Eq. (20-63) refers, of course, to the solution; the last term in 
Eq. (20-64) is the ideal entropy of mixing term. These properties of the 
pure reference component are determined by e 0 o, rJo, and the empirical 
law of corresponding states. 

The independent variables N, p, T and the function F e are more con¬ 
venient than N, V, T and the function A c , so we proceed now to make this 
change. Let p be the pressure on the actual solution when its volume is 
V and let V a be the volume of the actual solution when its pressure 
is p°. Now let us abbreviate Eq. (20-63) by the notation 

Ac(p, V) - A° c (p°, V) = 2 <P°, V). (20-65) 

What we want (since we keep the same reference solution at p°, V ) is the 
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corresponding equation for 

Fc(p°, V°) - F?(p°, V), 

where 

F c (p°, V°) = A c (p°, F D ) + p°V a , (20-66) 

Fc(p°, V) = A c {p«, V) + P°V. (20-67) 

If we integrate dA c /dV = —p from p°, V a to p, V, we obtain, to the 
linear term in F — F D , 

A e (p, V) - A c (p°, F°) = -p°(F - F D ). (20-68) 
From Eqs. (20-65) through (20-68) we find, then, that 
F„(N, p°, T) - F?(N, p°, T) 

= Yi XixAE°c(fii - 1) + 3(m-T - p°V)(gij - 1)] + • • •, (20-69) 

«>' 

where the right side is the same as in Eqs. (20-63) and (20-65). In Eq. 
(20-69), however, we regard F and E® as functions of M, p°, and T. 

Our next task is to derive an equation for AF e , which, incidentally, is 
the same as AF c(C xecss)> since nonconfigurational contributions are the 
same in F and Fjdeai and therefore cancel. For AT,- molecules of a single 
component i at p° and T, Eq. (20-69) reads 


Fci(Ni, p°, T) - F° ci (Ni, p°, T) 

= EUfu - 1) + 3 (NikT - p°Vi)(gu - 1) + • • •, (20-70) 

where F?,-, E?,-, and F t - all refer to AT,- molecules of the reference component 
at p° and T. If the pure components mixed to form an ideal solution, we 
would have 

F C (N, p°, T) (ideal) = ^ F ci (N { , p°, T) 

i 

+ MkT £ x { In x,'. (20-71) 

i 

Since the reference solution is ideal, we also have 


F?(N,p°,T) = Y,FUNi,p°,T) 

i 

+ MkT Xi In Xi. 


(20-72) 



394 


THEORY OP CONCENTRATED SOLUTIONS 


[CHAP. 20 


Finally, then, from Eqs. (20-69) through (20-72), 

AF, = AF ^excess) = F C (N, p°, T) - F e (N, p°, F) (ideal) 

= ^ awtftfC/w - 1) + 3(A/*7* - p°V)(gn - 1)] 

- L !*?«(/« - 1) + 3 (N<kT - p 0 y,)(ff« - 1)] + • • •. 

i 

But F?,- = X{Ec and 7,- = x,-7 (the reference solution is ideal), so that 
this last equation reduces to (put x< = x, Y.j x i) 

AF t — F? y ] XiXj(2fij fa fjj) 

i<3 

+ 3 (MkT - P°V) 2 XiXjVga - g« - go) + • • •. (20-73) 

«/ 


This is the central result of (first-order) conformal solution theory. If 
we make the presumably excellent approximation that 


r , _ (r?, + r|,) 
r '> ~ 2 


or (to first order) 


Qii = 


(g<< + Qii) 


(20-74) 


Eq. (20-73) simplifies to 

AF,(N, p°, T) = E° C (M, p°, T) £ *<*>*», 

i<i 

where 

dij = 2/ 1 , /y. 

Other excess quantities follow immediately from Eq. (20-74): 

- - (Hr),*., - - !«>*> f 20 - 75 ) 

= AF, + T AS t = (f? - F ^f) 2 (20-76) 

<2<W7 > 
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Note that for binary solutions the sum Y in the above equations reduces 
to a single term, x L x 2 d l2 , symmetrical about x x = $, as in the Bragg- 
Williams theory (Eq. 20-23). 

At ordinary pressures, E° is negative (2?° is practically equal to the 
negative of the energy of vaporization of the reference component at T), 
as are also all the other coefficients of Y in Eqs. (20-75) through (20-77): 
an increase in T expands the liquid, increases the distance between neigh¬ 
bors, and hence makes E° c less negative; an increase in p° contracts the 
liquid and makes E® more negative; both terms in the coefficient of Y in 
Eq. (20-76) are negative. The sum Y can be positive or negative, but is 
of course the same in all these equations. In a binary solution, d 12 and Y 
are positive if 12 pairs are energetically more stable than 11 and 22 pairs 
[this corresponds to a positive w in Eq. (20-6)]. In this case (binary, 
d i2 > 0), all the excess quantities are negative. In general (multicom¬ 
ponent), the excess quantities are all of the same sign, positive or negative, 
and in the ratio of the coefficients of Y for any composition. 

The above predictions (and others) of the first-order conformal theory 
of solutions were extensively compared with experiment by Longuet- 
Higgins in 1951. The theory proved to be quite successful when applied 
to data on suitable binary systems available at that time. In particular, 
all the excess functions were found to have the same sign. However, more 
recently other “suitable” solutions, such as neopentane + carbon tetra¬ 
chloride and methane + carbon monoxide, have been found experimentally 
to have positive A H e and negative AF e . We have already mentioned 
that this feature can be explained by the approximate “random"theories 
of Sections 20-2 and 20-3. 

The conclusion we reach is that although first-order conformal solution 
theory is formally exact for almost ideal solutions of “corresponding- 
states” molecules, its range of applicability to real systems is quite limited. 
To extend the range of applicability, one must turn to the exact (and rela¬ 
tively complicated) second-order conformal solution theory or to the ap¬ 
proximate theories of Sections 20-2 and 20-3. 

As a final remark, we remind the reader that the “theories” discussed in 
Section 20-3 and in the present section are semiempirieal in a sense, for 
they are based on experimental “corresponding-states” information about 
the pure components. This is justified in the absence of an exact theory of 
the liquid state. On the other hand, Section 20-2 is completely theoretical, 
but in it we must use an approximate (LJD) theory of liquids. 
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Problems 

20-1. Use the Gibbs-Duhem equation to deduce an expression for p B /p% 
from (a) Eq. (20-12) (ideal), and (b) Eq. (20-18) (Bragg-Williams). (Pages 374 
and 376.) 

20-2. Derive the Bragg-Williams “mixing” equations (20-19) through (20-21). 
(Page 377.) 

20-3. Expand the quasi-chemical equations (20-28) and (20-29) in powers of 
w/JcT to obtain the second-order results (20-30) through (20-32). (Page 380.) 

20-4. Show how to redefine A in equations such as (20-36) so that it may 
refer to diatomic or polyatomic molecules instead of to monatomic molecules. 
(Page 383.) 

20-5. Break up Eq. (20-49) for A A, into separate expressions for A E, and 
AS.. (Page 386.) 

20-6. Show that the <p terms in Eq. (20-49) (UD theory) are essentially 
equivalent to Eq. (20-23) (Bragg-Williams). (Page 386.) 

20-7. Discuss the equation of state and A A. from Eq. (20-53). (Page 388.) 

20-8. Develop a Bragg-Williams lattice theory of binary solutions, allowing 
each site to be occupied either by A, by B, or be empty. Take the lattice distance 
a =» constant. 

20-9. Derive expressions for pa from Eqs. (20-52) and (20-53). Will p A fv\ 
and p B /p% have the usual symmetry about xa “ 4? 

20-10. Show from Eq. (20-74) that, for a binary conformal solution, 

pi - (function of p°, T) + kT In *i + **• 

Compare this with the Bragg-Williams equation preceding (20-18). 

20-11. Discuss the equation of state and critical behavior (gas-liquid and 
solution-solution) of a conformal solution. 

20-12. In the application of Eq. (20-74) to a binary solution, A F. should 
be the same irrespective of the arbitrary choice of component 1 or component 2 
aa reference component. Investigate this point. 

20-13. What can be deduced about dis if = («i 1 * 22 ) 1/2 ? 

20-14. In a one-component “corresponding-states” system, Q is given by the 
equation preceding Eq. (20-52). If we regard A as a function of T, V, N, t, 
and r* 3 , find the coefficients a i and <12 in the equation 

dA = -SdT-pdV+pdN + aidt+a 2 dr* 3 . 

Derive an analogous equation for a conformal solution regarding A as a function 
of T, V, N, f, g. Use this result to provide an alternative (and more elegant) 
derivation of Eq. (20-69) for F» from Eq. (20-63) for A.. 
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CHAPTER 21 

POLYMER AND POLYELECTROLYTE SOLUTIONS 
AND GELS 

In Chapter 13 we considered the configuration of polymer molecules and 
rubber elasticity. Our object in the present chapter is to extend this dis¬ 
cussion to polymer and polyelectrolyte solutions and gels. Since this is a 
relatively specialized topic in statistical mechanics, we shall confine our¬ 
selves in each section to the simplest possible analysis that brings out the 
essential features. 

To present a unified discussion, we devote the first section to the Wall 
theory of rubber elasticity. This leads to the same length-force equation 
(13-57) as the (simplified) James-Guth argument of Section 13-3, but the 
method of Wall has the advantage that it has been exploited in the theory 
of polymer and polyelectrolyte gels (Sections 21-3 and 21-4). As far as 
the theory of rubber elasticity itself is concerned, the James-Guth theory 
(which we have not fully presented) is more detailed and fundamental. 

21-1 Wall theory of rubber elasticity.* The reader should review 
Section 13-3 through Eq. (13-54) for general background and point of 
view. 

We start with an isotropic cube of rubber with edge L 0 , under no forces. 
On a molecular level, we assume that the sample is made up of a cross- 
linked network of N chains, all of the same length. But our results will turn 
out to be independent of chain length, so the assumption of uniformity of 
chain length is not really necessary. We assume further that the distribu¬ 
tion of end-to-end lengths r of the N chains in the undeformed (Ljj) net¬ 
work is gaussian and is in fact the same distribution that N free molecules 
would have (Chapter 13). Equation (13-40) is therefore applicable, and 
we rewrite it in terms of the components of r as 

P 0 (x, y,z) dxdydz = e - '** 1 **+*‘+‘ 2 > dxdydz, (21-1) 

2r a 

* F. T. Wall, J. Chem. Phye. 10, 132, 485 (1942); 11, 527 (1943). See also 
Flory, Chapter 11. 
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The subscript zero refers to the undeformed state. Our final assumption is 
that if the cube is deformed from edges of length L 0 to edges of length 
L x , L y , L„ then the end-to-end distribution becomes 


P(x, y,z) dxdy dz = e -/>W/.;>-iV/«Z>+<**/«bi dx dy (21 _ 2 ) 

where a x = L x /L 0t etc. This is obviously true if each of the N chains is 
deformed in the same way (a x , <*„, a z ) as the bulk sample itself (Prob¬ 
lem 21-1), but this detailed an assumption is not needed [only the over-all 
distribution (21-2)]. 

In the undeformed state the chains are free to assume the unbiased 
“random walk” distribution (21-1). We may therefore anticipate that this 
distribution corresponds to the maximum possible entropy of the system, 
S 0 - Any deformation of the sample will lead to a decrease in entropy 
because the chains are forced to assume a distribution [e.g., (21-2)] which 
is not “random” or “unbiased.” Our next step is to compute the entropy 
difference S(a x , a v , a z ) — So- 

Let us divide the “end-to-end space” x, y, z [one end of the chain is 
chosen as origin, the other end is at x, y, z (Fig. 13-1)] into small elements 
of volume dr,- = dx,- dy { dz z . Then p, = Po(ti) dr,- is the probability of a 
free chain having an end-to-end vector in dr,. The undeformed state of 
the network corresponds to the chains having their most probable end-to- 
end distribution, with n,- = Np z chains in du lor each i. The deformed 
state a x , oty, a z , on the other hand, corresponds to a distribution which is 
not the most probable: there are, according to our assumption (21-2), 
Si = NP(,n) dr,- chains in dr,-, with s< A n< unless a x = a y — a, = 1. 
We want to calculate the ratio of the probabilities of observing these two 
distributions, Q(a*, a v , a x )/U 0 , because S — S 0 = k In (0/Q 0 )- 

The probability that any specific chain is in dr,- is p,-, and the probability 
that n,- = Npi specific chains are in dr,- is p"*. Therefore, for the unde¬ 
formed state (most probable distribution), 


Qo = 




(21-3) 


where the combinatorial factor is inserted because it is immaterial which 
particular chains are in dr,-, etc. Now if we place in each of the dr,-, not the 
most probable number of chains n,-, but some other number «,-, then the 
corresponding U is 


where we expect Q < Jl 0 (Problem 21-2). Equation (21-4) is a quite 
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general expression for any set of numbers «j, $ 2 ,..., but we are interested 
in the particular set «,• = NPdu from Eq. (21-2). On using 

2>. = i, !> = !> = *> 


we find from Eqs. (21-3) and (21-4), 




fio 


Si 


But 


i Hi l ^ o(^i) 1 

In — = In = In a x <v** 


so that 


Now 


In Jjj = JVTn azctyct, - 0 2 ^1-l) S * iX * + etc ]• 

^2 3 < x i = N X) P(ti)Xi dti = 2V JJJx 2 P(x, y, z)dxdydz 


_ Wa| 

2/3* ’ 

and therefore 

AS = S(a», a„, a,) — So = k In ^ 

= TVA: [In a x ctyC( t — J(a x + a 2 + <** — 3)]. (21-5) 

It is easy to verify (Problem 21-3) that AS has its maximum value at 
«* = «*=«*= 1, as we have anticipated. We note that AS is inde¬ 
pendent of chain length; it depends only on the number of chains in the 
network and on the macroscopic deformation parameters a*, a„, a*. 
Therefore, if the network has a distribution in chain lengths, we obtain a 
contribution to AS of the form (21-5) for each group of chains of the same 
length and a total AS still given by (21-5), where N is the total number of 
chains of all lengths. 

We next introduce a correction into Eq. (21-5), suggested by Flory.* 
Actually, this correction does not affect the length-force equation for 


* See Flory, p. 468. 
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rubber, derived below, but will influence our results beginning with 
Section 21-3. In deriving Eq. (21-5) we have assumed that the entropy 
of a network is the same as the entropy of the same number of polymer 
chains not involved in a network, but with the same end-to-end distri¬ 
bution. This neglects the fact that when a network is formed from 
chains, there is an entropy change which is volume-dependent, and 
therefore this network-formation entropy will in general be different at 
a*, a„, a* than at a x =<*„=<*« = 1. A network of N chains has N/2 
cross-links. Each cross-link is formed by chemical bond formation between 
a monomer on one chain and a monomer on another. If the second mon¬ 
omer must be within a small volume 5F (a constant) around the first 
monomer in order for reaction to occur, the probability for cross-link 
formation is proportional to SV/V. Hence the probability of forming 
N/2 cross-links is proportional to (SV/V) NI2 , and the ratio of this prob¬ 
ability at V to the probability at Ljj is (a^aya/) - ** 12 . This contributes an 
additional term to AS, —(Nk/2) In (axctya t ). The corrected form of 
Eq. (21-5) is thus 

AS = S(a x , ct y , or*) — So = (In axCtyCt, — a 2 — a 2 — a* + 3). 

( 21 - 6 ) 

We now consider the rubber elasticity problem specifically. Here, for 
stretching in the x-direction, 

V = Lq — constant, a x a y a t = 1, 

L x = L, a* = a, 

Ly = L„ <Xy — a* = • 

Therefore, Eq. (21-6) becomes 

S(«) -S(l) = ^(3-« 2 -f)- 
Then from Eq. (13-52), 



This is just the same equation as (13-57), obtained from the James-Guth 
theory, with C = L^ 2 (see also Fig. 13-4). 

21-2 Floiy-Huggins polymer solution theory.* We leave, temporarily, 
the discussion of polymer networks begun in the previous section and 
turn here to solutions of free polymer molecules (component 2) in a solvent 


* See Flory, Chapters 12 and 13. 
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(component 1). In the next section we will then combine the results of the 
present and preceding sections to study the swelling of polymer networks 
(gels) in a solvent. 

The theory to be presented here, due independently to Flory and 
Huggins, is a direct generalization of the Bragg-Williams approximation 
in the lattice model of binary solutions (Section 20-1). The thermodynamic 
equations of Section 20-1 are applicable without change. The Bragg- 
Williams theory is appropriate for solutions of molecules of approximately 
equal size—each site in the lattice can be occupied by either an A molecule 
or a B molecule. The essential difference in the present problem is that the 
polymer molecules are much larger (a factor of usually 10 3 or 10 4 ) than 
the solvent molecules. This leads to striking departures from the A-B 
symmetry we became accustomed to in Chapter 20. We.still use a lattice 
model and assume random mixing, but whereas a solvent molecule occupies 
only one site in the lattice, the polymer molecule occupies M sites along 
a “random walk” (Fig. 21-1), where M = 0(1O 3 or 10 4 ). We assume that 
all the chains have the same length. Since M should be regarded as the 
ratio of the two molar volumes, it will be of the order of the number of 
monomers or of the number of statistical units in a polymer chain, but 
not usually exactly equal to either of these numbers. 



Fio. 21-1. Lattice model (schematic, in two dimensions here) for polymer 
molecule in a solution. Sites not occupied by polymer segments are occupied by 
solvent molecules (one per site). 
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Because of the great difference in size of the two molecules, the mole 
fractions x\ and x 2 are no longer very useful. Instead, we use volume or 
site fractions: 

ATi MN 2 

V1 ~ Wi + MNl* * 2 = JTt + MN 2 ’ ^‘ + ^=1. 


We shall sometimes denote the total number of sites by Mo = N t + MN 2 . 

We investigate first the entropy of (random) mixing solvent and polymer 
molecules. Let £2(2Vj, N 2 ) be the number of possible configurations or 
arrangements of N t , N 2 molecules on M 0 sites, and let £2(0, N 2 ) be the 
number of possible configurations of N 2 polymer molecules on MN 2 sites 
(this refers to the pure polymer before mixing). Then, since £2 = 1 for 
Ni solvent molecules on N x sites (pure solvent), the desired entropy of 
mixing is 


AS m = k In 


Wi,_N£ 

£2(0, N 2 ) ' 


( 21 - 8 ) 


We now find £2 (N lt N 2 ). 

The number £2(iV,, N 2 ) is just equal to the number of ways of arranging 
N 2 polymer molecules on Mo sites, for after we place the polymer molecules 
in the originally empty lattice, there is only one way to place the solvent 
molecules (i.e., we simply fill up all the remaining unoccupied sites). 
Imagine that we label the polymer molecules from 1 to N 2 and introduce 
them one at a time, in order, into the lattice. Let be the number of ways 
of putting the t-th polymer molecule into the lattice with i — 1 molecules 
already there (assumed to be arranged in an average, random distribu¬ 
tion). Then the approximation to £2(iV 1 , N 2 ) which we use is 


«(tf 1, n 2 ) =^n«. 


(21-9) 


The factor (W 2 !) -1 is inserted because we have treated the molecules as 
distinguishable in the product, whereas they are actually indistinguishable. 

Next, we derive an expression for «< + i. With i polymer molecules 
already in the lattice, the fraction of sites filled is/,- = Mi/M 0 . The first 
unit of the i + 1-th molecule can be placed in any one of the M 0 — Mi 
vacant sites. The first unit has c nearest-neighbor sites, of which c(l — /,-) 
are empty (random distribution assumed). Therefore the number of 
possible locations for the second unit is c(l — /,-). Similarly, the third 
unit can go in (c — 1)(1 — /<) different places. At this point we make the 
approximation that units 4, 5, ..., M also each have (c — 1)(1 — /,-) 
possibilities, though this is not quite correct (Huggins’ treatment is more 
detailed here, but in view of the rather crude model, we omit this refine- 
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ment). Multiplying all of these factors together, we have for at+u 
»«+i = (M 0 - Mt)c(c - 1)"" 2 (1 - 

where we have replaced c by c — 1 as a further approximation. 

Now we will need 

JYj / _ A Ni-l 

In JJ «,• = N 2 (M — 1) In \—tf ~) + M £ In (M 0 - Mt). (21-10) 

«—i \ Mo / iwmQ 

We approximate the sum by an integral: 

j 0 (Mo “ Mi) di = i J n In udu 

= (M 0 In M 0 - M 0 - Nx In AT, + AT,). (21-11) 

We put Eqs. (21-10) and (21-11) in (21-9) and find 
In Q(AT,, N 2 ) = -N 2 In 1V 2 + AT 2 — AT, In IV, + ATi 

+ M 0 In Mo - Mo + AT 2 (M - 1) In • 

From this result, we also have 

In Q(0, N 2 ) = -JV* In N 2 + N 2 + MAT 2 In MAT* - MAT 2 

Therefore, from Eq. (21-8), 

^ = —JV, In v>x - JV 2 In (21-12) 

This very simple relation is the generalization of Eq. (20-20) to solutions of 
molecules of unequal size. Note that if we put M = l(<n = x lt <p a = x 2 ), 
we recover Eq. (20-20). Figure 21-2 shows AS m /M 0 k plotted against <p 2 
with M > 500 (Problem 21-4). This should be compared with Fig. 20-2 
for molecules of equal size. 

Next, we calculate A E m on the same random mixing (Bragg-Williams) 
basis. We let «n be the interaction energy between nearest-neighbor 
solvent molecules, w 22 between nearest-neighbor polymer units (not 
chemically bonded), and w\ 2 between one solvent molecule and one poly- 
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Fig. 21-2. Entropy of mixing as a function of volume fraction of polymer for 
M > 500 (the curve is independent of M for sufficiently large M). 

mer unit. In the solution N\, N 2 , the probability that any site is oc¬ 
cupied by a solvent molecule is <p\ and by a polymer unit is <p 2 . The 
average number of solvent molecules nearest neighbor to a polymer unit 
is (c — 2 )<pi, neglecting end-of-chain effects. Therefore the number 
77 12 of 12 interactions is 77 u = (e — 2)<piMN 2 . Similarly, 77 u = 
c<piNi/2 and 77 22 — (c — 2)<p 2 MN 2 /2. In the pure polymer, = 1 
and 77 22 = (c — 2)MN 2 /2, while in the pure solvent, 77 n — cN x /2. 
As a further simplification and approximation, we replace c — 2 by c in 
the above expressions, so that 

A E m = (c Vl MN 2 w 12 + ^ctpiNiWn + %ctp 2 MN 2 w 22 ) 

— (%cMN 2 w 22 -)- icNiWii) 

= _ cMoww , (21 _ 13) 

where, as usual, 

w — tt|i + «>22 — 2 wi 2 . 


Equation (21-13) is the generalization of Eq. (20-21); the two equations 
are the same when M — 1. We define the “mixing parameter” X by 
X = —cw/2kT. Then 

= xafow*. (21-i4) 

This is often called the “van Laar heat of mixing” expression. When w 
is negative and X positive, 11 and 22 neighbors are more stable than 12 
neighbors. In this case (x > 0) the solvent is said to be a “poor solvent” 
(solvent and polymer molecules “dislike” each other). A “good” solvent 



406 


POLYMER AND POLYELECTROLYTE SOLUTIONS 


[CHAP. 21 


has X < 0. We shall find below, just as in Section 20-1, that if X is positive 
and large enough (e.g., if w is negative and T is low enough), the solution 
will split into two phases with different compositions. 

We have mentioned in Section 20-1 that w may be regarded as a func¬ 
tion of temperature. This proves to be a particularly useful generalization 
in polymer solution work, though we shall avoid it by discussing isothermal 
processes only. If w is a function of T, AE m is a free energy rather than 
an energy. Even if w is a constant, X is a function of T, of course. 

Our derivations of AS m and AE m have assumed a random distribution 
of polymer units over the whole lattice. We should emphasize that although 
this assumption is in general a reasonable first approximation, it is not 
realistic when <p 2 is so low that only isolated molecules or pair, triplet, etc., 
interactions between otherwise isolated polymer molecules are involved. 
For in this case the concentration of polymer units will have a finite 
value within the space more or less occupied by polymer molecules but 
will be zero in the space between polymer molecules. Thus, for example, 
we cannot expect to get a reasonable osmotic pressure second virial 
coefficient from the present theory (see Section 21-6). 

Finally, the free energy of mixing follows from Eqs. (21-12) and (21-14): 

If = ¥ ! 'T = JVlln,,1+ ^ ln ^ + xarow*2. (21-15) 

The form of this result, which should be compared with Eq. (20-19), 
suggests that it ought to be possible to derive it without using a lattice 
model, and indeed this has been done.* 

With the free energy of mixing available, we can now easily find the 
chemical potentials pi and p 2 from the thermodynamic equation (20-1). 
Thus 

( dAA m /kT \ Pi(yz) - Pi(0) Pi 
V dN 1 ) Nt ,T kT m p0 

= In (1 — <p 2 ) + ^1 — <P 2 + *<P 2 , (21-16) 

where pi (0) is the chemical potential of the pure solvent and pj is the vapor 
pressure over the pure solvent. Equation (21-16) reduces to Eq. (20-18) 
when M = 1, but otherwise there is a term in <p 2 that destroys the sym¬ 
metry of Figs. 20-1, 20-3, and 20-4. When w = 0 in Eq. (20-18) (mole¬ 
cules of approximately equal size), Raoult’s law is obeyed. But if we put 


* J. H. Hildebrand, J. Chem. Phys. 15, 225 (1947); H. C. Longuet-Hiqgins, 
Faraday Soc. Disc. 15, 73 (1953). 
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Fig. 21-3. Effect of molecular size of solute on vapor pressure of solvent 
when X = 0. 


X = 0 in Eq. (21-16), there remain considerable deviations from Raoult’s 
law (Fig. 21-3) that can only be associated with the great difference in 
molecular size of the two components. 

For n 2 we have 

( dA A n /kT \ _ M2(»2) — Ms(l) 

V dNz /jVi.r kT 

= In <p 2 ~ (M - 1)(1 - <pz) + XM(1 - <p 2 ) 2 . (21-17) 

Actually, Eq. (21-17) contains no new information. It follows from 
(21-16) and the Gibbs-Duhem equation or from Eqs. (20-2) and (21-16) 
(Problem 21-5). 

Equation (21-16) is also essentially an equation for the osmotic pressure 
(2 = nondiffusible solute, 1 = diffusible solvent). For, with an incom¬ 
pressible solvent as in the present model, we have from thermodynamics 


M i(»a) — Pi(0) _n^i 

kT ~ kT’ 


(21-18) 


where »i is the volume per molecule (i.e., volume per site) of pure solvent, 
a constant. 
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Fio. 21-4. Critical behavior and phase separation for polymer solution with 
M = 1000. 

Figure 21-4 shows a plot of — [pi — ni(0)]/k,T or n»i/kT against <p 2 
for M = 1000 and different positive values of X (i.e., different temperatures 
for constant w). Critical behavior and phase separation are evident, as in 
Section 20-1, but here the curves are very unsymmetrical. The horizontal 
line can be located by use of the criteria that 

Mifas) = A*i(^a)> MsW) = M2(«4')> 

where <p 2 and <p 2 (<p 2 < <p 2 ) give the composition of the two phases in 
equilibrium. The first of these criteria is of course satisfied by any hori¬ 
zontal line; but the horizontal line has to be adjusted to satisfy the second 
one, using Eq. (21-17) for p 2 . Alternatively, if the plot in Fig. 21-4 is 
made against 1 /<p 2 instead of <?% (n vs. 1/V 2 is analogous to p vs. v for 
a gas), the equal-area theorem can be used to locate the horizontal line 
(Problem 21-6). 

To locate the critical point we put 

_ sVi _ Q 
S<p 2 Sv 2 ’ 

using Eq. (21-16), and find 

- + 0 - 3 ?) + 2xn - “■ 

1 

(1 - V *) 2 


+ 2X = 0. 
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Fig. 21-5. Temperature-composition curve for polymer solution with 
M «= 1000. 


Eliminating 2x, we get the critical value 

1 


V 2c = 


Then 


1 + 3f»/2 
(1 + 3/ 1/2 ) 2 


2 M 


As 3/ —> oo, these relations become 


<P 2c 


3 / 1/2 ' 


c 2 ^ 3/1/2 ^ 2 


(21-19) 

(21-20) 


For 3/ — 1000, <P 2 c — 0.0307 and X c = 0.532. When 3/ = 1, we get 
the same results as in Section 20-1 (Bragg-Williams). 

If X is positive and less than X c (=0.5) or if X is negative, the polymer and 
solvent are miscible in all proportions. But if X > x e , phase separation 
occurs for some compositions. Since necessarily <p' 2 < <p 2 o one phase is 
extremely dilute in polymer. If we vary X by varying T, we obtain a 
temperature-composition curve, as in Fig. 21-5 for M = 1000. Note 
especially the marked asymmetry. In the shaded area of this figure, 
two solution phases are present (immiscible region); the miscible (one 
solution) region is outside the shaded area. 

As an illustration, we give in Table 21-1 a few experimental values 
of X for the polymer molecules of natural rubber. 

A great deal of work has been done on checking the predictions of the 
Flory-Huggins theory against experimental results. The reader should 
consult the Supplementary Reading list for summarizing discussions. We 
shall merely state here that the theory proves to be completely satisfactory 
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Table 21-1 

Mixing Parameter x for Natural Rubber 


Solvent 

t, °C 

X 

CC1 4 

15-20 

0.28 

benzene 

25 

0.44 

CS 2 

27 

0.49 

Ethyl acetate 

25 

0.78 

Methyl ethyl ketone 

25 

0.94 

Acetone 

25 

1.37 


in a qualitative or, indeed, semiquantitative way, but a number of detailed 
discrepancies have been uncovered which require refinement in the theory. 
There is no doubt, though, that this relatively simple theory contains the 
essential features which distinguish high polymer solutions from ordinary 
solutions of small molecules. 

21-3 Swelling of polymer gels.* If a sample of free (uncross-linked) 
polymer molecules is put in contact with pure solvent at the same pressure, 
the polymer will take up solvent to form a solution. In fact, if a large 
amount of solvent is available, it will continue to enter the solution in¬ 
definitely (v >2 —> 0) because the chemical potential of the solvent in the 
solution is always lower than in the pure solvent at the same pressure. 
However, we can stop the process after a finite amount of solvent has 
mixed with polymer by making the pressure on the solution higher than 
on the pure solvent (Ap = osmotic pressure = II). If, on the other hand, 
a network of polymer molecules is put in contact with free solvent at the 
same pressure, solvent will be absorbed by the network forming a gel, 
but the process will stop after a finite amount of solvent absorption 
(without establishing a pressure difference) because the tendency of solvent 
molecules to mix with the polymer chains is resisted as a result of the 
fact that mixing entails stretching of the network in this case. 

In this section we employ the concepts of Sections 21-1 and 21-2 to 
construct an approximate theory of polymer network swelling. Actually, 
it is easy to give a treatment that includes stretching as well as swelling: 
we shall therefore investigate, below, the equilibrium swelling of a net¬ 
work, under a pulling force r in the ^-direction, when in contact with solvent 
at the same pressure (Fig. 21-6). 

In view of the semiquantitative success of the Wall and Flory-Huggins 
theories (the latter is more accurate than the former), we can anticipate 


* See Flory, Chapter 13, for further details. 
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the same order of agreement with experiment here, for no new ingredients 
need to be added to the theory. 

Consider the following mixing process. The initial state, with Helm¬ 
holtz free energy A 0 , is: (1) an undeformed polymer network (free of 
solvent) made up of N 2 chains, each with M units (of solvent size), with 
volume Vq = L% — MN 2 vi; and (2) N i molecules of pure solvent with 
volume NiVi. The final state, with Helmholtz free energy A, is the above 
network, swollen with Ni molecules of solvent, with volume (assumed 
additive) V = (N 1 + MN 2 )vi and shape L x — L = aL 0 , L y — L z — 
(F/L) 1/2 . We also have the relations V/V 0 = l/<p 2 (called the swelling 
ratio) and a y — a, = l/(a<p 2 ) 112 . 

The basic thermodynamic equation for the gel (swollen network) is 

dA — — S dT -|- t dL Hi dN i -|- M2 dN 2 . 

In practice, of course, one can vary T, L, and N i but not N 2 . 

The free energy change, A A = A — A 0 , in the above process is 

A A = A A m + A A s = A A m - T AS* (21-21) 

where AA m is the free energy of mixing network with solvent, and AS* is 
the entropy of deforming the network. The implicit approximation is 
made here that the free energies of deformation and mixing are independent 
of each other. We obtain A A m from Eq. (21-15), but we have to make an 
appropriate alteration to take care of the fact that here, instead of N 2 
free polymer molecules, we have a single giant polymer molecule (the 
entire network). Thus the coefficient of In <p 2 in Eq. (21-15) is essentially 
zero. The volume fractions retain the same physical meaning, however, 
so we have 


A J 

-p=r = JVi In vi + xM 0 <pi<p 2 , 


( 21 - 22 ) 
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where, in the relation <p 2 — MN 2 /M 0 , N 2 now refers to the number of 
chains in the network. Equation (21-6) gives ASj- Therefore, 

^ = Ni In <pi + xMo<Pif 2 + ^ ^ln ¥>2 + a 2 + — 3^ • (21-23) 

We are particularly interested in the chemical potential of the solvent, 
for it determines the swelling equilibrium. When equilibrium has been 
reached (Fig. 21-6), Pi in the gel must equal mi in the pure solvent in 
contact with the gel: /ti(<p 2 ) = pi(0). Therefore, we have for this equi¬ 
librium condition, from Eq. (21-23), 

Mi(»s) ~ Mi(0) ( dAA/kT ) = ( dAA/kT \ 

kT \ dN\ V /r,«,ATj 

= In (1 — <e 2 ) + <p 2 + x<p$ + — y) = 0. 

(21-24) 

Equation (21-24) determines the equilibrium swelling ratio 1/^2 for a 
given extension a. One can give the following physical interpretation of this 
equation. The first three terms are equal to —n nc t«i /kT [see Eq. (21-18)], 
where n nc t is the osmotic pressure of the network. These terms lead to an 
expansion of the gel. (By themselves, they would equal zero and hence 
lead to equilibrium only when <p 2 —* 0.) The last two terms are equal to 
—PdVi/kT, where pa is the pressure associated with deformation of the 
network. This follows from 

dAA d 1 dA A d 
Vd ~ dV ~ v ! dN 1 ' 

Actually, pd is negative and —pd is a force per unit area tending to con¬ 
tract the gel. When equilibrium is reached, these two opposing pressures 
just balance each other: n ne t = ~Pd- 
The special case of Eq. (21-24) which is of most interest is free (and 
isotropic) swelling of the gel in the solvent; that is, r = 0 and F/F 0 = 
a 3 = \/<p 2 . Therefore we replace 1/a in Eq. (21-24) by <p\ 13 and solve 
the resulting equation for <p 2 . In a good (or not too poor) solvent and with 
ordinary values of M, the swelling ratio l/<p 2 will exceed 10, and <p 2 < 0.1. 
(Of course, if the swelling is too great, say 1 /<p 2 > 30, the gaussian dis¬ 
tribution used in deriving the deformation terms begins to become inac¬ 
curate.) Therefore we expand the logarithm in Eq. (21-24) to obtain 
the approximate equation 


(21-25) 
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log 1/(1 - x) 


Fig. 21-7. Theoretical behavior of a stretched, swollen gel. 


If we also neglect £ compared to <p 2 2I3 > we have 




(21-26) 


As a numerical example of this last equation, a swelling of a = 3 and 
I/V 2 = a 3 = 27 is obtained if we take X = 0.25 and M = 972, or if we 
take X = 0 and M — 486. Figure 21-7 (t — 0 curve) shows a plot of a 
against log M (£ — X), calculated from Eq. (21-25). 

Equation (21-26) gives reasonable agreement with experiment for 
solvents leading to large swelling. The exponent 5/3 has been verified 
experimentally, and the values of X calculated from swelling experiments 
and Eq. (21-26) agree fairly well with those found from vapor pressure 
measurements on the uncross-linked material using the Flory-Huggins 
theory. 

Returning now to the more general situation in which the gel may be 
stretched, we calculate the length-force relation from 


T = fid) kT( dAA/kT \ 

\9L/T,Ki,Nt Lq \ da /r,A'„JV a 

with the result 

tL 0 __ 1 _ 

N 2 hT ~ 1 ~ a aV 2 ’ 


(21-27) 


For a given a, we find <p 2 from Eq. (21-24) and then t from Eq. (21-27). 
When l/v >2 is large, we expand the logarithm and find (Problem 21-7) 
the generalization of Eq. (21-25), 

a 2 (« — <)[a(a — 0 — $] = Af(£ — X). (21-28) 


If we also drop £ compared to a 2 , we have 


t — a 


- x)] 1/2 

a 3l2 ’ 


(21-29) 
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which reduces to Eq. (21-26) if t — 0. Curves for t = 1 and t = 2, cal¬ 
culated from Eq. (21-28), are included in Fig. 21-7. 

Comparison of Eqs. (21-27) and (21-29) shows that, when 1 /vj 2 is large, 

— = [M(i - X)] ll2 a 112 . (21-30) 

<P2 

This equation predicts that a highly swollen gel, in equilibrium with 
solvent, will absorb more solvent when it is stretched. This is confirmed 
experimentally. 

21-4 Swelling of polyelectrolyte gels.* Suppose that along the molecu¬ 
lar chains of the network studied in the preceding section there is a fixed 
charge ze on every xthf unit (on the average) formed by the dissociation 
of an ion (e.g., —XNa —*■ —X - + Na + ). In this case the “solvent” is 
an electrolyte solution (usually aqueous), though we still treat it for 
simplicity as a one-component solvent in nonelectrostatic expressions. 
The gel now resembles the “inside” solution in a Donnan membrane 
equilibrium (Section 19-2), and the “solvent” in which the gel is immersed 
(Fig. 21-6) is the “outside” solution. The fixed (“solute”) charges are 
nondiffusible (confined to the gel), but the electrolyte ions are diffusible. 
No membrane is necessary because the fixed charges are attached to the 
polymer chains. 

The repulsion between the fixed charges furnishes an additional tend¬ 
ency for the gel to swell (beyond what one would calculate from Section 
21-3). We give in this section the first-order theory of this effect. Later 
in the section, we indicate how the treatment can be generalized to 
take care of the case in which the charge on the polymer chains can 
vary in amount because of a dissociative or binding equilibrium (e.g., 
—COOH — COO" + H+). 

Aside from intrinsic interest, important applications of this work are 
to the electrostatic theory of muscle contraction and to ion exchange 
resins. Muscle contraction will be referred to again at the end of the 
section. 

For simplicity we assume that although the fixed charges are attached 
to polymer chains, they are spatially distributed in the volume V occupied 
by the gel just as if they were free to move throughout V. Because of the 
Brownian motion of the polymer chains, this is not an unrealistic assump¬ 
tion. However, these charges are not and do not count as additional 


* The simple theory in the present section is included in the following papers, 
where a more detailed treatment will be found: T. L. Hill, J. Chem. Phys. 20, 
1259 (1952); Faraday Soc. Discussions 13, 132 (1953). 
t This x has no relation to x or z< in Chapter 20. 
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molecules in contributing, for example, to the osmotic pressure. We 
assume further that electrostatic effects are independent of mixing and 
deformation effects—we have merely to add appropriate electrostatic 
terms. 

The simplest procedure for us is to work directly with Eq. (21-24) 
rather than with the free energy, because of information already available 
in Chapter 19. (The free-energy approach is the subject of Problem 21-8.) 
We have seen that the first three terms in this equation may be associated 
with the osmotic pressure n ne t of the chains of the network. Similarly, the 
additional tendency of the repulsion between fixed charges on the polymer 
chains to expand the gel may be attributed to the osmotic pressure Ilei of 
these “nondiffusible” or “solute” charges. We proceed next to evaluate n e i. 

We saw in Section 19-2 that the leading terms in the osmotic pressure 
for a system of charged (ze) solute molecules with number density p is 

k 

where p% in £ refers as usual to the fcth diffusible ionic species in the 
outside solution. The second term in Eq. (21-31) is a consequence of the 
electrical neutrality in the outside solution (see Problem 19-5). (The 
second term can also be interpreted as due to the excess total concentra¬ 
tion of diffusible ions, inside over outside; or as due to the repulsion be¬ 
tween solute molecules.) In the present case, the fixed charges do not 
count as additional molecules contributing to II, but they do produce a 
“neutrality” term. That is, in Eq. (21-31), since we are dealing with a 
single giant molecule (network), we let p —► 0, keeping the “solute” charge 
density pze constant [this resembles the argument leading to Eq. (21-22)]: 

n*i (charge density) 2 /nt on , 

kT -2^E-’ (21_32) 

The fixed charge density in the gel is 

MNj z» _ MNjZ* V 0 _ z&p 2 
x V ~ xVo V ~~ xvi 

The term to be added to Eq. (21-24) is —U t \V\/kT. Thus we have 

In (1 - fp a ) + <P 2 + *<fil + jj (j ~ y) - = 0 ‘ < 21_33 > 
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The length-force relation, (21-27), is unaltered because n e i depends only 
on the volume of the gel and not its shape. 

It is clear from Eq. (21-33) that the effect of adding charges to the net¬ 
work, when an electrolyte solution is “solvent,” is simply to lower the 
effective value of X to 


x — 


2x*viZ 


(21-34) 


This increases the degree of swelling, as expected. Figure 21-7 is applicable 
with the effective X in (21-34) replacing X itself. The electrostatic term in 
(21-34) is seen to be proportional to the square of the amount of charge 
placed on the gel (« z 2 /x 2 ) and inversely proportional to the ionic strength 
of the electrolyte solution in which the gel is immersed. As a numerical 
example, take |z| = 1 and let the electrolyte be of type 1-1 with 
number density p°. In an aqueous solution, p°vi — c°/55.5, where c° is 
the molar concentration of electrolyte. Thus (21-34) becomes, in this 
special case, 


55.5 

4x 2 c° 


(21-35) 


Now X is of order unity, often less than 0.5. If we take c° = 0.1 mole- 
liter -1 , and x = 10, the electrostatic term in (21-35) is 1.39; if x — 
(1000) 1/2 = 31.6, it is 0.139. 

When r = 0 (free swelling) and the degree of swelling is large, Eqs. 
(21-26) and (21-35) give us 

n £ = 1) _ [ *-x+ (13.9 /x 2 c°) 1 3/s 

v(z = o) - L i - x J 11 

for the ratio of gel volumes with and without fixed charges on the gel. For 
example, if X = 0.25 and c° — 0.15 mole-liter -1 , 

= 9.4 when x = 3, 

= 2.5 when x = 10. 

Because of the enhanced swelling in polyelectrolyte gels, the gaussian 
distribution is often inadequate if an accurate theory is desired, and must 
be replaced by the Langevin function (Eq. 13-20). 

Next, we consider a polyelectrolyte gel with an ionic dissociation (or 
binding) equilibrium. Let N be the number of ions bound to the network 



21-4J 


SWELLING OF POLTELECTROLYTE GELS 


417 


(each such ion has a charge mze) and let /i be the chemical potential of the 
bound ions. The total number of sites for binding is B. These are dis¬ 
tributed uniformly over the chains of the network, and are all equivalent. 
The partition function of a bound ion is q. A term n dN has to be added 
to the thermodynamic expression for dA. When no ions are bound, the 
network is assumed to have a total fixed charge of MN 2 z«/x. This becomes 
nze when N ions are bound, where 


n 


N 2 M 

x 


+ Nm. 


We have to extend Eq. (21-21) for A A. The initial state, with Helm¬ 
holtz free energy A 0 , has N = 0. The final state, with free energy A, 
has N bound ions. Thus, assuming additivity of all contributions to A.A, 

AA = A — A 0 = A A„ - TAS d + AA e i + AA»ds, (21-37) 

where AA e i is the electrostatic free energy (the work necessary to charge 
up the network in the presence of electrolyte), which we have to deduce 
from 11,1, and where AA a( j a is (Eq. 7-5) 

AAad, = -kT[B\nB - JVlnN - (B - IV) In (B - N)+N\aq(T)\. 

(21-38) 


Now we consider AA e i. First we note that n can be written as 


n = 



The quantity xB/N 2 M, a constant, is the number of ion binding sites on 
the polymer chains (B) divided by the number of fixed charges on the 
chains ( N 2 M/x ). As usual, 9 = N/B, the fraction of binding sites oc¬ 
cupied. Thus the last term in Eq. (21-33) has to be modified to read 


n e itq zV|[l + 6m(xB/N 2 M)) 2 
kT ~ 2a: 2 t»iL 

Now, as in Eq. (21-24), we have 

/ a AAoi/kT \ _ _ ITeiPi 

\ dN\ /T,a,N 2 ,x kT 


(21-39) 


(21-40) 


so we can obtain AA e i from Eqs. (21-39) and (21-40) by integrating with 
respect to N u which enters only in <p 2 = N 2 M/(Ni + N 2 M). We choose 
as limits iVi and iVj = oo. When Ni = oo, V = oo and AA c i = 0 (the 
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work of charging is obviously zero). We find easily (see also Problem 21-8) 


A .del z 2 n 2 

kT ~ 2VX 


(21-41) 


It has already been mentioned that the swelling equilibrium equation, 
(21-33), is modified by use of (21-39) as the electrostatic term. The 
length-force relation is still (21-27). We must, finally, derive the binding 
equilibrium equation from 


H (dA/kT 
kT~\dN . 


\ = ( dAA el /kT \ 

/ Nl . Ni . L.T \ 9 N /Ar„.V 2 , 


( d AAaAa/kT \ 

\ dN )i 


z 2 nm , . 9 

VT, + (1 - 9)q ' 


We can rewrite this as 


X(f = _A_ exp j ^ n H l +Jrn^B/N 2 M )\^ . 


The quantity \q is equal to the outside concentration of the ions in the 
binding equilibrium, multiplied by a constant (Eq. 7-9). 

The three basic equations referred to in the above paragraph determine 
the properties of the system. For example, consider the free swelling 
(r = 0) of a polyelectrolyte gel with a binding equilibrium. The param¬ 
eters X, M, z, m, x, B/N 2 M, i'i, and ]£ all have to be assigned. From 
Eq. (21-27), a -1 = <p\ 13 , which is put in Eq. (21-33) [modified by (21-39)] 
to eliminate a -1 . Then, if we assign a value to <p 2 , 9 can be calculated from 
this equation. Finally, the pair of values <p 2 , 1 9 is put in Eq. (21-42) and 
\q is calculated (Problem 21-9). 

A theory of polyelectrolyte gels, in a somewhat more sophisticated 
form than above, has been developed and tested experimentally by A. 
Katchalsky and his collaborators. * Again the theory proves to be adequate 
for semiquantitative purposes. 

Important early work in this field was done by Hermans and Overbeek.f 
Perhaps the most elaborate unified theory of polyelectrolyte solutions and 
gels at present available is that due to Harris and Rice4 


* See, for example, A. Katchalsky, J. Polymer Set. 7 , 393 (1951); A. Katchal¬ 
sky, S. Lifson, and H. Eisenbero, ibid, 7, 571 (1951; A. Katchalsky and 
I. Michaeli, ibid. 15, 69 (1955); A. Katchalsky and M. Zwick, ibid. 16, 
221 (1955). 

5 J. J. Hermans and J. Th. G. Overbeek, Rec. trov. chim. 67, 761 (1948). 
F. E. Harris and S. A. Rice, J. Phys. Chem. 58, 725, 733 (1954); J. Chem. 
Phya. 25, 955 (1956); S. A. Rice and F. E. Harris, J. Chem. Phys. 24, 326, 
336 (1956); Z. physik. Chem. (Frankfort) 8, 207 (1956). 
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Finally, we mention briefly an interesting possibility in the application 
of the above theory to muscle action. It can be shown* that a polymer 
or polyelectrolyte gel, constrained to a constant radius and with X suf¬ 
ficiently larger than 0.5, will exhibit a first-order phase transition in its 
length-force curve. This means, for example, that the gel may undergo 
a very large change in length (“contraction”) at constant force as a con¬ 
sequence of a very small change in some parameter, such as ionic strength 
or the concentration of an ion or molecule being bound on the polymer 
chains. This type of precipitous or “razor-edge” behavior, observed in 
all first-order transitions, seems characteristic of muscle action. Other 
first-order phase transitions which have also been mentioned as possi¬ 
bilities in this connection are the two-dimensional a-i3 transition* (Sec¬ 
tion 14-2) and the crystalline-amorphous transition in polymers, f 


21-5 Isolated polymer or polyelectrolyte molecules in solution.}: We 
emphasized in Section 21-2 that the Flory-Huggins solution theory was 
not suitable for extremely dilute polymer solutions because the theory 
assumed a uniform distribution of polymer units or segments throughout 
the volume of the solution. In this section we give an approximate treat¬ 
ment designed to take care of the limit of infinite dilution. In this limit 
we need consider only a single polymer (z = 0) or polyelectrolyte molecule 
immersed in the solvent. The polymer segments are confined to the region 
occupied by the molecule (Fig. 21-8). Between molecules are large spaces 
of pure solvent. In the next section we treat pair interactions between 
polymer molecules (which determine, for example, the osmotic pressure 
second virial coefficient). 

The point of view we adopt here resembles that of Chapter 13 in that 
we consider a single polymer molecule and the solvent it encloses as a 
thermodynamic system. Specifically, we treat the single molecule as if it 
were a spherical gel with uniform polymer density. This allows us to take 
over, with little change, the equations of the preceding two sections on 
macroscopic gels. 

In a good, or not too poor, solvent, or if charged, the polymer molecule 
will expand over a considerable volume of solvent (in which the polymer 
volume fraction might be, say, 0.01 or less). The mixing and electrostatic 
free energies are responsible for this, just as in Section 21-4. However, 


*T. L. Hill, J. Chem. Phys. 20, 1259 (1952); Faraday Soc. Disc. 13,132 
(1953). 

t P. J. Floby, Science, 124, 53 (1956). 

t See Flory, Chapter 14, for the polymer problem and T. L. Hill, J. Chem. 
Phys. 20, 1173 (1952) for the polyelectrolyte problem. 
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Fio. 21-8. Isolated polymer molecules in a very dilute solution. 


the tendency to expand is resisted by the low probability of highly extended 
molecular configurations (Chapter 13). This is a deformation free energy 
effect, just as in previous sections. 

When we treat a single polymer molecule in the solvent as a uniform 
“gel,” we introduce two new approximations. First, the polymer units 
have approximately a gaussian distribution rather than a uniform dis¬ 
tribution about the center of the sphere. We shall replace this gaussian 
distribution by a uniform equivalent sphere. Second, especially in con¬ 
nection with A.4 e i, there are surface effects which may become significant 
for a relatively small sphere but which are negligible for macroscopic gels. 
The criterion, as far as electrostatic effects are concerned, is whether or 
not the radius of the sphere is large compared to the Debye-Huckel 1 /*. 
If the electrostatic forces are large enough, these surface effects will, 
in fact, lead to nonspherical (e.g., ellipsoidal) shapes for the molecule.* 
In an extreme case, the molecule will become an extended rod in 
order to reduce AA e i. We shall not pursue these shape changes here, 
however. 

Floryf has shown that, in the present problem, the actual approximately 
gaussian distribution of polymer units about the center of the polymer 
molecule in a “free” (or random walk) polymer chain can be replaced by 
an effective uniform distribution of polymer units over a sphere of diameter 
equal to the root mean-square end-to-end distance (r*) 1,a = M ni2 a' 
(Eq. 13-45), where a' is the length of a statistical unit and M' is the number 
of statistical units in the chain. This is an intuitively reasonable result. 
When the chain is expanded (or contracted) in each dimension by a factor 


* T. L. Hill, J. Chem. Phye. 20,1173 (1952). 
t P. J. Flory, J. Chem. Phys. 17, 303 (1949). 
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a relative to the random walk state, the radius of the effective sphere 
becomes 

_ «(F) m _ 

' 2 2 

and the volume 



ira 3 M ,3l2 a' 3 

6 


= Ca 


3 


(21-43) 


which defines C (the volume of the sphere when a = 1). 

We use the basic equation 

AA = A - A 0 = A A m — T ASd + AA e] , (21-14) 


where A is the free energy of the single polymer or polyelectrolyte chain 
of M units and of the solvent in the volume V{a) = Vi(Ni + M). Here 
A 0 is the free energy of the initial (before mixing) pure solvent and of the 
(discharged) polymer chain in the random walk state, a = 1 (i.e., in a 
sample of undeformed and uncross-linked polymer chains). We retain 
the notation F 0 = Mv t and <P 2 = F 0 /F == M/(N i + M), where now 
<p 2 refers to the interior of a single poljrmer molecule and not to the whole 
solution (for which <p 2 —* 0). The free energy of mixing is given by Eq. 
(21-22), which we rewrite as 


A A m F-Fp F-Fp xF 0 (F - Fp) 
kT ~ vi m V ^ v t V 


(21-45) 


We get ASd from Eq. (21-5) [the correction in Eq. (21-6) is omitted since 
there are no cross-links here], with N = 1: 


^ = = -![(|r-.j. < 2 «e> 


Finally, if we are concerned with a polyelectrolyte molecule, we have 
from Eq. (21-41), 


AAei = z 2 M 2 
kT 2*2 FL' 


(21-47) 


Substitution of Eqs. (21-45) through (21-47) in Eq. (21-44) gives AA. 
We find the equilibrium degree of swelling by setting Vi d A A/dV = 0, 
which is obviously equivalent to Eq. (21-24). We obtain in this way 

In (1 - <p 2 ) + Vi + (x - ** + ~ ja) = °» ( 21_48 ) 


just as in Eq. (21-33), except for the deformation terms. 
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The volume fraction of polymer, <p 2 , will be smaller here than in the 
corresponding swollen network; for in a network, when a = 1 (random 
walk state), <p 2 = 1, but here, when a = 1 , <p 2 is already small, say, 
0.01 (Problem 21-10). The reason for this is that a random walk con¬ 
figuration is a rather open one, and in the present problem the open space 
is filled with solvent rather than other chains of a network. The relation 
between <p 2 and a is <p 2 = il/iq/Ca 3 . 

Therefore it will almost always be legitimate to expand the logarithm 
in Eq. (21-48), with the result 


If we put Ma = M’a' (o is the length of a polymer unit with volume tq) 
and i'i = a®, the coefficient in Eq. (21-49) becomes 


l£ 

Cvi 




In a typical case, a' = 3a, so that 

+ < 21 - S0) 


If the polymer is not charged (x = 0) and X = 0 (good solvent), and 
if M' = 900, then a = 1.7, 

if M' = 10,000, then a = 2.1. 


If x == i, a = 1. For this value of X, the tendency of the molecule to 
swell because of the entropy of mixing with solvent is just balanced by 
the tendency of the molecule to compress itself because of the poor solvent. 
The result is a molecule in the random walk state (a = 1). 

With a polyelectrolyte, the coefficient of M nl2 /Z can easily be 3 [see 
Eq. (21-35)]. In this particular case, 

if AT = 900, then a = 2.1, 

if M' = 10,000, then a = 2.6. 

Equation (21-49) has been subjected to extensive experimental checking 
(see Flory, Chapter 14) using, especially, frictional properties of the 
polymer molecules. The general conclusion is, again, that the theory is 
fairly successful but has some shortcomings, as should be expected. The 
electrostatic term, in particular, seems to predict too large an effect. 
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It is easy to extend the above theory to a polyelectrolyte molecule with 
an ionic binding equilibrium (see Section 21-4), but we leave this as an 
exercise for the reader (Problem 21-11). 

21-6 Second virial coefficient in polymer and polyelectrolyte solutions, f 
We have so far combined the basic ideas of the Wall and Flory-Huggins 
theories to treat macroscopic polymer (and polyelectrolyte) gels and also 
infinitely dilute solutions in which a single polymer molecule in a solvent 
is treated essentially as a very small gel. In this section we make a straight¬ 
forward extension of the considerations of the preceding section to a 
slightly more concentrated polymer solution. Specifically, we deduce the 
second virial coefficient B\ (Eq. 19-37) from a potential of mean force w(r) 
calculated as the work necessary to bring two isolated polymer molecules 
together from r — oo to r. 

The argument we use is based on the uniform effective sphere model of 
the preceding section. This model is quite sufficient to bring out the 
essential points. Refinements will be found in the references already given. 

Consider first two isolated uncharged (z = 0) polymer molecules in a 
solvent, each with radius R (Section 21-5). If the distance r between 
centers is greater than 2 R, there is no overlapping of the spheres and no 
interaction between the molecules. Thus w(r) = Ofor r > 2 R. If r < 2 R, 
the spheres overlap. We assume, as a first approximation, that the over¬ 
lapping does not distort the molecular configuration or the distribution of 
polymer segments. If V' is the volume of overlap of the two molecules, we 
find from elementary geometrical considerations (overlap of two spheres), 

V>(r) = ^ (2 R 3 + (21-51) 

Also, if the polymer volume fraction in each of the isolated spheres is <p 2 , 
it is 2<p 2 in V'. 

The above discussion also applies to polyelectrolyte molecules, but this 
statement requires some special comment. In general, we would expect 
that two spherical polyelectrolyte molecules of radius R and fixed charge 
zeM/x would have w(r) > 0 for r > 2 R, even though electrolyte solution 
can penetrate inside the spheres. This will be true for small polyelectrolyte 
molecules.]; But implicit in the model adopted in the preceding and present 


t See Floby, Chapter 12; also B. H. Zimm, J. Chem. Phys. 14, 164 (1946) and 
T. A. Orofino and P. J. Flory, J. Chem. Phys. 26, 1067 (1957); J. Phys. Chem. 
63, 283 (1959). 

t See, for example, F. T. Wall, el al., J. Chem. Phys. 26, 114 (1957); 31, 
1640 (1959); T. L. Hill, J. .4m. Chem. Soc. 78, 1577 (1956); 80, 3241 (1958). 
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sections is the assumption that a polyelectrolyte molecule is large enough 
to be considered to have the properties of a very small macroscopic gel. 
As far as electrostatic effects go, this condition is equivalent to R y> 1/k. 
In this case, the iixed charges will be virtually neutralized by the diffusible 
ions of the electrolyte. The electrostatic potential will be constant and 
the net charge density will be zero both inside and outside the sphere (the 
potential has a different value inside and out), except for a relatively un¬ 
important boundary region at the edge of the sphere with thickness of 
order 1/k. Thus, if R » 1/k, two polyelectrolyte molecules with r > 2R 
“see” each other as essentially electrically neutral bodies, and hence 
w(r) — 0. Consequently, we can use the same kind of argument for both 
polymer and polyelectrolyte molecules. 

In a typical numerical example, 1/k might be 20 A (Eq. 18-25) and 

B _^V_ 2XMX 1 2 =ia)A 

In this connection we might recall Eq. (19-43) for the second virial 
coefficient of large spheres with a surface charge and in the presence of 
nonpenetrating electrolyte. The “neutrality” term in that equation is pro¬ 
portional to the square of the net charge (re) in the region r < a (all of 
the charge happens to be on the surface). In the present problem, the 
corresponding term in vanishes because the net charge in r < 2R is 
essentially zero. 

We return now to the overlap volume V' when r < 2 R. When we bring 
two molecules together from infinity to r, the only change occurs in the 
two volumes V’ destined to overlap. Thus we can represent the process by: 

2 [volume V' with ip 2 ] —> 

( 1 ) 

[volume V' with 2^ 2 ] + 

( 2 ) 

[volume V' with <p 2 = 0). (21-52) 

( 0 ) 

The free energy change in this process is 

AAoverlap = w(r) = AA <2) + AA (0) - 2 AA a) , (21-53) 

where AA (n) is to be calculated as in Eq. (21-44). A helpful simplification 
is that AS«» = 0, A£^ 2> = 2 AS^\ and hence there is no deformation 
contribution to AA 0V . This is a consequence of our assumption that over¬ 
lapping is “additive,” i.e., without distortion of polymer configuration. 
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Let us put Eq. (21-44) in the form 


where 


and 


^p = M n) ln(l -«4 n) ) 

, xry 2 n) (i - / 2 n) ) A^ n) 

' Vi k 

■ xVa-Vr' 

2xMZ 


^a 0) — 0, 1P2 1 * = <P2> <P2 2> = 2<p2> 

Y<n> (i - 


(21-54) 


Substitution of Eq. (21-54) into Eq. (21-53), with expansion of the 
logarithm, leads to 

lr - £ 0 - 2X +si) 7 ' (r) <r £ 28> ’ < 21 -“ ) 

where V'(r) is given by Eq. (21-51) (Problem 21-12). The same elec¬ 
trostatic term as in Eqs. (21-33) (polyelectrolyte gel) and (21-49) (single 
polyelectrolyte molecule) enters here and lowers the effective value of X. 
When x — 0, tc(r) has the sign of 1 — 2x. Thus, in a good solvent (e.g., 
X = 0), u>(r) > 0 and work has to be done to force the molecules to over¬ 
lap. This is because the overlapping process, (21-52), involves a partial 
unmixing of polymer and solvent. In a poor solvent with X > 0.5, w(r) is 
negative: the molecules “want” to overlap to increase the number of 
polymer-polymer and solvent-solvent interactions relative to polymer- 
solvent interactions. When X = £, these mixing and interaction effects 
just balance, and u>(r) = 0 for all r. That is, polymer molecules inter¬ 
penetrate without effective interaction. This of course leads to a vanishing 
second virial coefficient, — 0. 

Next, we use the w(r)/kT of Eq. (21-55) to calculate B\ from Eq. 
(19-37). Because the integration cannot be carried out analytically, we 
resort to an expansion of e~ wlkT : 

* - -*r [- w+ s fey - ■ • ]*- < 2i -“> 

The contribution from r > 2R is zero because w — 0 in this range. We 
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find from Eqs. (21-51), (21-55), and (21-56), after elementary integrations, 

="'■'(!+2^-*) 

0.324[* + (z 2 / 2 xV£) - x]V 2 0 
ViV 

This is the same as Flory’s result [Eq. (75') of his Chapter 12], derived 
from a gaussian instead of uniform segment distribution, except that 0.324 
is replaced by 0.354. Using V = Ca s and Eq. (21-49), Eq. (21-57) 
becomes 

B* 2 = Ca\a 2 - 1)[1 - 0.324(« 2 - 1) + • • •], (21-58) 

where, it will be recalled, C is the volume of the effective isolated polymer 
sphere (Eq. 21-43) when a — 1. Equation (21-58) expresses in terms 
of the swelling parameter a of an isolated polymer molecule, which in turn 
is determined by Eq. (21-49). When the quantity in large parentheses 
in Eqs. (21-49) and (21-57) is zero, the isolated molecule is in the 
“random walk state,” a = 1, and B% = 0. When B\ ^ 0, B% has the 
sign of the quantity in large parentheses in Eq. (21-57), as expected from 
our discussion of Eq. (21-55). Convergence of Eq. (21-58) is rapid only 
when a is near unity, that is, when the quantity in large parentheses in 
Eq. (21-57) is small. 

Incidentally, the leading term in Eq. (21-57) is the same as the second 
virial coefficient obtained (a) from the Flory-Huggins theory (Prob¬ 
lem 21-13), and (b) as a first approximation by Zimm, using the McMillan- 
Mayer theory. 

Orofino and Flory have tested a somewhat refined version of this theory 
against extensive experimental results on both polymers and polyelec¬ 
trolytes. The agreement between theory and experiment is very satis¬ 
factory for polymers but less so for polyelectrolytes. Again the predicted 
electrostatic effects are too large. 


+ 


(21-57) 
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Problems 

21-1. If Eq. (21-1) gives the end-to-end distribution in an undeformcd sample 
of rubber, show that Eq. (21-2) gives the distribution in a deformed sample, 
assuming each chain is deformed in the same way (a*, a„, a.) as the bulk sample. 
(Page 399.) 

21-2. Use the method of undetermined multipliers to prove that the maxi¬ 
mum Q in Eq. (21-4) for given N and p is obtained when s t — Np ( . (Page 399.) 

21-3. Show that AS in Eq. (21-5) has its maximum value when a x = a„ = 
a, = 1. (Page 400.) 

21-4. Make a rough plot of <f 2 vs. n and AS m /Mok vs. X 2 for M = 1000. 
(Page 404.) 

21-5. Use Eqs. (21-16) and (21-17) for mi and M 2 , and A = Nyti + N 2 P 2 , 
to check Eq. (21-15) for A/I*. (Page 407.) 

21-6. Show that use of the equal-area theorem on a plot of II vs. I/V 2 to 
locate the two-phase equilibrium point in a polymer solution is equivalent to 
using the equality of the two chemical potentials in the two phases. (Page 408.) 

21-7. Derive Eq. (21-28), for the relation between a and t in a polymer gel, 
from Eqs. (21-24) and (21-27) by expanding In (1 — ^ 2 ). (Page 413.) 

21-8. In a macroscopic polyelectrolyte gel the total charge density can be 
written (compare the Poisson-Boltzmann equation of Chapter 18) 

where 4 is the electrostatic potential in the gel (relative to the outside as zero). 
Linearize the exponential, solve for 4/, and deduce Aj4 e i in Eq. (21-41) by a 
charging process. (Page 418.) 

21-9. Consider the free swelling of a gel with, say, —COOH groups which 
dissociate into —COO - and H + . That is, the gel has fixed negative charges on 
the —COO - groups and can bind H+, one for every —COO - . Calculate and 
plot the swelling ratio 1/<P2 as a function of 1 — 6, the fraction of dissociated 
—COOH groups (as, for example, in a titration of the gel with base). This 
particular case implies 2 = —1, m = —1, and XB/N 2 M — 1. Assume the 
electrolyte is of type 1 — 1, with concentration c° = 0.15 mole-liter -1 . Also, 
take *®10,X« 0.25, M = 400, and m = 18 cm 3 -molc -1 . In Eq. (21-33), 
expand the logarithm and drop the term ^ 2/2 compared with 1/a = 

Also, calculate and plot the titration curve, 1—9 against \q. Repeat the calcu¬ 
lation with c° — 0.05 mole-liter -1 . (Page 418.) 

21-10. Show from Eq. (21-43) that <p 2 is of order 10 -2 when a = 1, in a 
typical case. (Page 422.) 

21-11. Make the necessary modifications in Section 21-5 when the poly¬ 
electrolyte molecule is involved in an ionic dissociation or binding equilibrium. 
(Page 423.) 

21-12. Make a rough plot of the function V'(r) between r = 0 and r = 2ft. 
(Page 425.) 
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21-13. Show that the osmotic pressure second virial coefficient obtained from 
the Flory-Huggins theory agrees with the leading term in Eq. (21-57). (Page 
426.) 

21-14. Modify the Flory-Huggins theory for a heterogeneous polymer (mix¬ 
ture of polymer molecular weights). 

21-15. Investigate whether the Flory-Huggins type of argument can be 
applied to long rigid rods (instead of random coils). 

21-16. Extend the Flory-Huggins theory to polyelectrolyte molecules. If 
the polyelectrolyte molecules are involved in an ionic binding equilibrium, what 
happens to the titration curve (adsorption isotherm) when a phase separation 
occurs? 

21-17. Discuss qualitatively some of the errors made in assuming additivity 
of the free energy terms in Eq. (21-37). 

21-18. Beginning with Section 21-4, the solvent (electrolyte solution) has 
several components, but we treat it as a single component in nonelectrostatic 
expressions. Investigate the legitimacy of this procedure as far as the entropy 
of mixing solvent with polymer is concerned. 

21-19. According to Section 21-5, how does the radius R of the effective 
sphere vary with the molecular weight of the polymer? 

21-20. Deduce one more term in the expansion of B* in Eq. (21-57). Com¬ 
pare with Flory [Eq. (75'), Chapter 12). 
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CHAPTER 22 


QUANTUM STATISTICS 

Although the argument in the first few chapters of the book is based 
on quantum mechanics, up to now we have bypassed all topics which in¬ 
volve any but the most elementary and familiar quantum-mechanical 
ideas (energy levels, degeneracy, etc.). This has been done for the benefit 
of those readers with practically no background in quantum mechanics. 
In this chapter we consider a few of the more fundamental and simple 
problems in quantum statistics. Even here the demands on the reader’s 
knowledge of quantum mechanics are very modest—in keeping with the 
introductory nature of the present work. 

Sections 22-1 to 22-4 are concerned with the quantum statistics of one- 
component systems of indistinguishable particles without interparticle 
interactions. (We use the term “particle” here and below to refer to funda¬ 
mental particles, atoms, or molecules.) In Sections 22-5 to 22-7 molecular 
interactions are allowed. This makes the problem very involved, so our 
treatment is strictly introductory. Section 22-8 is concerned with the 
special topic of dilute symmetrical diatomic gases at low temperatures 
(e.g., ortho- and parahydrogen). 

22-1 Introduction to Fermi-Dirac and Bose-Einstein statistics. The 
problem we consider here is the same as that of Section 3-2 but now we 
give a general treatment. We are concerned with a macroscopic system 
of identical and indistinguishable particles, which do not exert forces on 
each other, in a volume V (or area a, etc.) and at temperature T. The 
possible energy eigenvalues for a single particle in V are denoted by 
e 1( « 2 , 63 ,... (functions of V), and the associated energy eigenfunctions 
are <^ 2 , ta, • ■ ■ ■ For simplicity of notation, an energy level with de¬ 
generacy u is listed here co times (i.e., these are energy states, not levels). 
In Section 3-2 we limited ourselves to special conditions such that the 
number of energy states of the above type (available to any one particle) 
is very much larger than the number (2V) of particles in the system. When 
this is the case, we found (Eq. 3-10) that the canonical ensemble partition 
function of the system is simply 

Q = (22_1) 

q = 2 e ~' ilkT - (22-2) 

j 
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When we remove the above restriction, as we do here, the situation is 
more complicated. In particular, because of symmetry restrictions on the 
wave functions of the system, the particles are not independent of each 
other (even though interparticle interactions are absent): Q is no longer 
essentially given by a product of (independent) q’s, q 1 *; indeed, the single 
particle partition function q no longer arises as a significant quantity. 
But the general results we shall find will reduce to Eqs. (22-1) and (22-2) 
as a limiting case, as expected. 

There is no way of detecting experimentally an exchange, one for the 
other, of two identical particles in a system. Let ip be the wave function 
representing the state of a system of N identical and indistinguishable 
particles. Then we can easily deduce'" from the above-mentioned experi¬ 
mental fact that ip may behave in one of two ways: when the coordinates 
of two particles are exchanged in the function ip, ip may remain unchanged 
or ip may change sign. We say that ip is symmetrical in the coordinates of 
identical particles in the former case and antisymmetrical in the latter case. 
One of the fundamental postulates or principles of quantum mechanics 
is that (a) the only states available (or “accessible”) to real systems of 
indistinguishable particles are those represented by wave functions which 
are either symmetrical or antisymmetrical, and (b) particles with half¬ 
integral spin (e.g., electrons, protons, neutrons) have antisymmetrical 
wave functions, while those with integral spin (e.g., photons) have sym¬ 
metrical wave functions. Nuclei, atoms, ions, and molecules made up of 
an odd number of electrons, protons, and neutrons (e.g., D, He 3 ) are in 
the antisymmetrical class, and those with an even number (e.g., H, H 3 , 
D 2 , He 4 ) are in the symmetrical class. The above restrictions on accessi¬ 
bility are applicable whether or not there are interparticle forces. They 
are also applicable to multicomponent systems in an obvious way (Prob¬ 
lem 22-1). 

The canonical ensemble partition function is in any case (interparticle 
forces, multicomponent systems) 

Q = e~ Bi ' kT , (22-3) 

i 

where the Ej are eigenvalues of the Hamiltonian operator X for the system. 
But we now understand explicitly that we are to include in the sum only 
those energy eigenstates associated with energy eigenfunctions with the 
correct symmetry properties. This was implicit in our discussion in 
Chapter 1, but the point was not emphasized. In other words, the sum 
is over all accessible energy states; those states which are inaccessible 
because of symmetry (or other) restrictions are omitted from the sum. 

* See, for example, L. D. Landau and E. M. Lifshitz, Quantum Mechanics. 
Reading, Mass.: Addison-Wesley, 1958; p. 204. 



22-1J 


FERMI-DIRAC AND B0SE-E1NSTEIN STATISTICS 


433 


la a one-component system, we include in the sum either all the sym¬ 
metrical or all the antisymmetrical states, depending on the kind of 
particle. It is customary to say that Bose-Einstein statistics is involved 
in the former case and Fermi-Dirac in the latter. 

We now return to the special case of a one-component system of non¬ 
interacting particles. It will no doubt be helpful to the reader if we illus¬ 
trate the above considerations on symmetry restrictions by an explicit 
simple example. Consider a system with only three particles, N = 3; 
call them a, b, and e. For simplicity, suppose that the (nondegenerate) 
single-particle energy eigenvalues e t , < 2 , «a,..., are proportional to the 
quantum number: = jS, where 8 is a constant (as for a harmonic 
oscillator with proper choice of energy zero). Let us use the canonical 
ensemble partition function in the form (Eq. 1-37) of a sum over energy 
levels instead of states: 

Q = £ n<e-* </kr . (22-4) 

i 

Consider, say, the system energy level E — 98. That is, 9 units 
(1 unit = 8 ) of energy are to be divided up among the three particles. 
The possibilities are listed in Table 22-1 (Problem 22-2). The numbers 
under a, b, and c are particle quantum numbers j for the respective particles; 
the sum of the j values is 9 in every case (E = 98). The first row cor¬ 
responds to the system energy eigenfunction ^ = ^ 7 (a) 'h(b) ^i(c), where 
(a) refers to the coordinates of particle o, etc. This function satisfies the 
Schrddinger equation for the system, 3C^ = 98^. The number 3 under 
“permutations” refers to the fact that the first row might have read 1, 7, 1 
or 1,1, 7 as well as 7, 1,1 [the first of these corresponds to the system 
energy eigenfunction f t (a) ^ 7 ( 6 ) ^i(c), etc.]. In the second row, there 
are six different functions possible for the set of quantum numbers 6 , 2 , 1 ; 


Table 22-1 


System of Three Particles 


Row 

a 

b 

■ 

Permu¬ 

tations 




1 

3 




1 

6 




1 

6 




2 

3 

5. 



1 

3 

Mm 


3 

2 

6 

mm 

3 

3 

3 

1 
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etc. So far we have ignored symmetry. If we continue to do so, and this 
would be correct if the particles are distinguishable (as in an Einstein 
crystal, for example), we deduce, by adding up the column of numbers 
under “permutations,” that there are 28 different wave functions which 
satisfy the equation = 95^. That is, the system energy level E = 95 
has a degeneracy Qd = 28 (D = “distinguishable”). Thus one term in 
Eq. (22-4), for this simple example, is 28e~ wkT . 

Suppose the three particles obey Fermi-Dirac statistics (e.g., they 
might be electrons). If we nm through the list of 28 energy eigenfunctions 
referred to above, we find that none of them is antisymmetrical. For 
example, 

ie(a) Mb) fiic) * —Mb) M«) 'hie), 


where we have exchanged the coordinates of particles a and b. But we 
have not really exhausted all the possibilities. Any linear combination of 
two or more of the 28 functions is also a solution of the (linear) equation 
K>f/ = 9 8 rfr. On examining linear combinations, we find, for example, 
that the function 



M<*) 

Mb) 

Mb) 

+ = C 

M«) 

Mb) 

Me) 


lh(«) 

Mb) 

(e) 


(22-5) 


where C is a normalization constant, meets our requirements. This function 
satisfies the equation X\f/ = 95^ because it is a linear combination of the 
6 functions corresponding to row 2 in Table 22-1 (as can be seen on 
expanding the determinant). Also, it is an antisymmetrical wave function, 
for to exchange, say, (a) and ( b ), amounts to exchanging two columns in 
the determinant. But it is well known that the exchange of two columns 
changes the sign of a determinant. In this way, (22-5), we can construct 
one antisymmetrical energy eigenfunction for the system from each of 
rows 2, 3, and 6 in Table 22-1 (the quantum numbers are all different 
in each case). No such function can be formed from any of the other 
rows, however. For example, the determinant constructed from the set 
of quantum numbers 7,1,1 has two rows exactly alike and therefore is 
identically equal to zero. This will be the case whenever two or more particles 
are in the same quantum state. Further inspection reveals no other anti¬ 
symmetrical linear combinations of the 28 functions (except linear com¬ 
binations of the three determinants already found). Therefore, in this 
example, there are 3 antisymmetrical system energy states belonging to 
the eigenvalue E = 95; that is, Gfd = 3. Thus one term in Eq. (22-4), 
for a Fermi-Dirac system, is 3e~ 9ilkT . 
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The above argument can obviously be generalized to a system contain¬ 
ing any number N of noninteracting particles. We reach the general con¬ 
clusion that a quantum state of a Fermi-Dirac system in which two or 
more particles are in the same particle quantum state is inaccessible. An 
alternative statement is that, in Fermi-Dirac statistics, each particle 
quantum state can be “occupied” by either zero or one particle at a time. 
This is just the Pauli exclusion principle (which forms the basis of the 
periodic table when applied to electrons in atoms). For example, when 
the above system of 3 particles is in the state (22-5), particle states 1, 2, 
and 6 are occupied (we cannot say by which particle), while states 3, 4, 5, 
7, 8, ... are unoccupied. 

Next, we consider the Bose-Einstein case (e.g., the three particles 
might be He 4 atoms). We examine the 28 wave functions above and find 
that only the last one, 

t — Mo) Mb) Me), 


is symmetrical. But, if we allow linear combinations, we find that in fact 
one symmetrical wave function can be formed from each of the rows in 
Table 22-1. All these functions can be represented in the form of a modi¬ 
fied determinant; for example, from row 1 (Problem 22-3), 


t = C' 


Mo) Mb) Me) ' 
lh(«0 Mb) 'hie) , 
'h(o) tiib) Me) 


( 22 - 6 ) 


where the prime on the determinant means that in expanding the de¬ 
terminant all signs are to be taken positive. This scheme gives, for row 7, 

= QC'Mo) Mb) Me), 


etc. Thus, in this example, £2be = 7, and one term in Eq. (22-4) is 

7 e - 9 «/*r 

From the above simple example we can make the following general state¬ 
ment: in Bose-Einstein statistics (symmetrical wave functions), the number 
of particles in a particle quantum state is unrestricted; the number can 
be 0,1,2,3,..., N (whereas in Fermi-Dirac statistics only 0, 1 are 
possible). 

There is another important general deduction we can make by inspection 
of Table 22-1. We note that, in this example, 
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It is easy to see that this will always be true. Each Fermi-Dirac state 
(rows 2, 3, and 6 in Table 22-1) corresponds to Nl “distinguishable” 
states. But there may be some sets of particle quantum numbers (rows 
1, 4, 5, and 7 in Table 22-1) which give “distinguishable” states but not 
a Fermi-Dirac state. Therefore Qd > Nil 2fd- Also, each set of particle 
quantum numbers gives one Bose-Einstein state. The maximum number 
of “distinguishable” states for one set of quantum numbers is Nl (all 
quantum numbers different). Therefore Qd < AHQbe- 

It is also clear that if E is sufficiently large (high temperature), the 
number of available particle quantum states can be very large compared 
with the number of particles N (see Sections 3-2 and 4-1). In this case the 
vast majority of sets of particle quantum numbers will have all quantum 
numbers different (i.e., because of the large excess of particle quantum 
states, only infrequently will two particles be in the same quantum state, 
even if allowed). Thus, for large values of E, the equalities in (22-7) 
are approached: 

be —* <— Ofd (E —» oo). (22-8) 

Hence, the two kinds of statistics give the same limiting result. This 
limiting type of statistics (treating the particles as distinguishable to get 
0 d, and correcting with Nl) is referred to as “classical” or “Boltzmann.” 
We have been using Boltzmann statistics in most of the book, since the 
limiting condition is achieved in the usual problems of interest. 

The canonical ensemble expression corresponding to Eq. (22-8) is 

£ Q* E e~ EilkT Q?e~ B < lkT *- £ aJ D e~ E ‘ lkT . (22-9) 

Actually, this relation and (22-8) are not restricted to systems of non¬ 
interacting particles (see Sections 6-2 and 22-6). 

We turn now to the explicit problem of calculating thermodynamic 
functions for a one-component system of noninteracting indistinguishable 
particles. The customary treatment* is based on fl and the microcanonical 
ensemble. Table 22-1 makes it clear how one is to proceed in principle. 
For given N and E, Gbe is the total number of different ways of dividing 
E up among the N particles, treating the particles as indistinguishable 
and without restricting the number of particles in any one particle 
quantum (energy) state. For £1fd, however, we have the limitation that 
not more than one particle can be in a particle quantum state (hence 
J2fd < 8be)- In practical cases, N and E are of course very large (macro- 


* See, for example, Mayer and Mayer, Chapter 5. 
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scopic). The restriction of fixed values for N and E requires use of unde¬ 
termined multipliers (Problem 22-4), or the method of steepest descents. 
A simpler procedure is to employ the grand partition function (ji, V, T 
specified). This avoids the restraints of constant N and E. 

In Chapter 7 [see, especially, Eqs. (7-26), (7-31), and (7-34), and the 
discussion following Eq. (7-27)], we found that a grand partition function 
£ for a single subsystem was useful. Specifically, we considered a system 
in which the adsorption sites were distinguishable and the particles in¬ 
distinguishable. In Eq. (7-31) (Langmuir adsorption), the number of 
molecules per site was restricted to zero or one, while in Eq. (7-34) (B.E.T. 
adsorption) the number of molecules per site was not restricted. The 
situation here is very similar. A particle quantum state with the particles 
occupying it is a subsystem in the present problem. The particle quantum 
states are distinguishable, but the particles are indistinguishable. In 
Fermi-Dirac statistics, the subsystem has s = 0 or 1, and in Bose-Einstein 
statistics s = 0,1,2,- 

In Fermi-Dirac statistics, then, 


E(m, V, T) = II ZAV, T), £y = 1 + e-*> (K)/tr X, X = e' lkT - 
’ ( 22 - 10 ) 

The first term in £y is associated with sy = 0, and the second with Sj = 1, 
where e—'i ,kT [corresponding to q in Eq. (7-31)] is the partition function 
of one particle in the energy state «/. The mean number of particles in 


3 



Fra. 22-1. Bose-Einstein, Fermi-Dirac, and classical (Boltzmann) distribu¬ 
tions: mean number (I) of particles in a quantum state as a function of the 
energy (<) of the state. 
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Fig. 22-2. Fcrmi-Dirac distribution at different temperatures, 
the system is 

<22 - n > 

where 

s . _ x = e ~' i,kT * = _L_ • ( 22 - 12 ) 

3 V ax ) v.T 1 + e-* lkT \ e ( v- w/tr + ! (22 12) 

This is the Fermi-Dirac distribution law (see Fig. 22-1). 

Equation (22-12) gives the mean number of molecules in the state 
For any finite (j and constant T:iip —* — oo, 5/ —» 0 (therefore the system 
approaches zero density, TJ/V —* 0); and if p —» +oo, 5y —* 1 (the system 
approaches infinite density unless there are only a finite number of particle 
energy states). For finite €;, constant p, and T —* 0 (see Fig. 22-2): 
Sy —► 0 if > p and 5/ —* 1 if €,• < p. Thus, at the absolute zero of 
temperature, the particles fill up (one per state) all energy states below 
€ — p and leave all states above e = p completely unoccupied. Because 
the particles cannot all go into the ground state at T = 0, a Fermi-Dirac 
system necessarily has a large zero-point energy. (If p is chosen less than 
«!, the ground state, all 5,- = 0 and Tf = 0 at T = 0.) For T slightly 
above absolute zero, the distribution is no longer a step function, and 
there is partial occupation (0 < J,- < 1) of states in the neighborhood 
of € = p (Fig. 22-2). 

We noted in connection with Eqs. (22-8) and (22-9) that both kinds 
of quantum statistics are expected to go over into classical statistics if 
the number of available quantum states greatly exceeds the number of 
particles, N. In the present context and notation, this means Sy —* 0 
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for all energy states (i.e., sy = 0 almost always, but occasionally 8y = 1). 
We see from Eq. (22-12) that this condition is achieved if X —* 0 (e.g., 
7J/V —> 0 with T constant or T —* +oo with TJ/V constant). Equation 
(22-12) becomes, for small X, 

5 y = \e- , i' kT (X -> 0), (22-13) 

or 

—«-IkT 

(22-14) 

where X has been eliminated (X = N/q) by summing (22-13) over j. The 
quantity Sy/17 is the fraction of molecules in state j, or the probability 
of any one particular molecule being in state j. Equation (22-14) is the 
Boltzmann distribution, the same as Eq. (3-35), as should be expected. 
We observe that q and the Boltzmann probability e~‘i lkT /q arise only in 
the limit X —> 0. Figure 22-1 includes the classical or Boltzmann case, 
(22-13). 

In Bose-Einstein statistics. 

3(m, K.n-IlMFJ), (22-15) 

J 

*y - 1 + e-' (V)/M X + e-*V<v>/*V + • • • =- l ~ ir ■ 

1 — e »' X 

(22-16) 

The successive terms in the above series are for *y = 0, 1, 2, ..., the 
number of molecules in state j, with energies 0, €y, 2ey, ..., and partition 
functions 1, e~'i lkT , e~ 2 'i lkT , etc. The series converges only if e~'f lkT \ < 1. 
Since this condition must hold for all states j, we have that n must be 
less than the ground state energy e x (usually taken as zero for conven¬ 
ience). There is no corresponding restriction on values of y in Fermi- 
Dirac statistics. Equation (22-11) for TJ still applies but here we find 

■'-‘(TT-'X.r'e^krr (22 - 17 > 

This is the Bose-Einstein distribution law, which differs from the Fermi- 
Dirac law only by the sign in front of the unity in the denominator. 
Figure 22-1 shows that for a given y < < t , each energy state ey has a 
mean population which decreases in the order BE, Class, FD. The result 
BE > FD is to be expected simply because of the restriction «y = 0 or 1 
for FD. Relative to the classical case, the larger (smaller) number of 
particles in the system for given y in the Bose-Einstein (Fermi-Dirac) 
case corresponds to an effective attraction (repulsion) between the parti¬ 
cles. We shall see this also in Eqs. (22-38) and (22-76). 
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In a Bose-Einstcin system, for any p < e u when T — * 0 all 5,- —♦ 0 
and 37 —* 0. However, if p is only infinitesimally less than and T —* 0, 
then we still have S>• —* 0 for all j > 1, but Si can be made as large as 
one pleases. Thus if we put e t — p = kT/U , 

h = ■ ,y_ ~ =37 (T —> 0). (22-18) 

In other words, in the limit T —» 0, the only thermodynamically significant 
(37 > 0) value of p is p = ti ; also, in this limit, all particles are in the 
ground state («i) and all excited states (j > 1) are unoccupied. This is 
in distinct contrast to the behavior of a Fermi-Dirac system at T = 0. 

The low-temperature behavior for either kind of statistics also follows 
from Fig. 22-1 if we keep in mind that as T —► 0 the horizontal energy 
scale in the figure shrinks proportionally. 

If we let X —» 0 in Eq. (22-17), we have 

= Xe ~' llkT - °)- (22-19) 

This is again the classical or Boltzmann distribution, (22-13), as it should 
be. Each 3j <SC 1 : usually gj — 0, occasionally a s — 1, and only very rarely 
is sj > 1. 

Finally, let us indicate in a formal way how the various thermodynamic 
functions are obtained in either kind of quantum statistics. The variables 
Ii, V, and T are independent and specified in advance. Then, from Eq. 
(22-11), the mean number of particles in the system is (upper sign FD; 
lower sign BE) 

^ ± r 

The average energy is obviously 

Also, for the equation of state, we have 

pV = kT In S = ±kT £ In [1 ± e < — y)/tI ’]. (22-22) 

i 

In these equations the tj are functions of V (or of the external variables 
of the problem). All other thermodynamic functions can now be found 
from p, V, T, 37, H, and p (Problem 22-5). 

We consider the three most important special cases in the following 
sections. 
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22-2 Ideal Fermi-Dirac gas; electrons in metals. The particular model 
we consider here is a noninteracting gas of Fermi-Dirac particles in a box 
of volume V (ideal Fermi-Dirac gas). The energy levels are then those 
of a particle in a box, Eq. (4-2). Because the energy levels are extremely 
close together, for macroscopic V, we can treat them as continuous. 
Equation (4-9) gives the number of translational states between e and 
e -f- dc. 

For concreteness, we discuss one particular example—the electrons in 
a metal. In this case there is an electron spin of one-half (in units of 
h/2rr) and degeneracy of two. We therefore multiply Eq. (4-9) by two and 
have 

«(«) de = 4ir * Ve U2 de. (22-23) 


If an electron at rest in the gas phase is used to locate the zero of energy, 
then each electron in the metal is assumed to move in a constant potential 
field <p (compare <p in Chapter 16). This potential is negative (of the order 
of several electron volts), owing to the attraction between an electron and 
the lattice of positive ions, and goes to zero at the surface of the metal. 

It is at first glance quite surprising that a noninteracting gas of electrons 
is of any use at all as a model for the electrons in a metal. The fact is 
that the deductions from this model are in good qualitative (not quanti¬ 
tative) agreement with experiment (heat capacity, electrical and thermal 
conductivity, thermionic emission, etc.). The essential reasons for this 
agreement are: (1) each electron interacts with the other electrons and 
the lattice of positive ions by long range coulomb interactions which 
change only relatively slowly with the position of the electron (hence the 
use of <p = constant); and (2) as we shall see below, quantum effects are 
extreme in this case, and these (rather than interaction) dominate in the 
behavior of the electrons. 

For simplicity, from here on we take (but see Problem 22-6) the zero 
of energy as the ground state in the metal, ti = 0 (instead of «i = <p). 
Then Eq. (22-20) becomes 


77 = 



e m de 

e (.-M)/*r + x • 


(22-24) 


Let us check immediately to see if the classical limit (omit the +1) can 
be used. This requires X « 1. We can use the classical (ideal gas) X for 
this test, Eq. (4-23) (the electron spin introduces a factor of two, which 
is unimportant for the moment). That is, if X c i a »» 1 then X = X 0 i M8 . 
Now the condition « 1 from Eq. (4-23) is just the condition 
A 3 N/V « 1 in Eq. (4-6). Let us take m = electron mass, T = 300°K, 
and N/V corresponding to 10 cm 8 *mole -1 for the metal and one free 
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electron per metal atom. We find 



which is large, rather than small, compared with unity (see also Table 4-1). 
Thus quantum effects are large here; this is because of the high free-electron 
density and, especially, low mass. (The term “strong degeneracy” is often 
used in both kinds of statistics to refer to the case X c i a8 a » 1; “weak 
degeneracy” means almost classical, X e uu < 1.) One can use the classical 
limit of Eq. (22-24) only above about T = 10 5o K (Problem 22-7), at 
which temperature, however, the metal would be vaporized. 

Thus, from the point of view of the electrons in a metal, room tempera¬ 
ture is itself a relatively low temperature (see Fig. 22-2). In fact, equations 
valid in the limit T —> 0 give numerical results that are quite accurate 
(as we shall confirm below) at room temperature. Therefore we begin by 
investigating the low-temperature limit. 

When T = 0, 8 = 1 for e < p and 5 = 0 for « > p, as already pointed 
out. Equation (22-24) becomes, then, 

* - *(&)” - w 


The number of electrons that can be accommodated by the system is 
clearly just equal to the number of quantum states with energy p or less 
[compare Eq. (4-5)]. Or, if we regard N/V as given, all energy states up 
to the so-called Fermi energy, 


M = 



(22-26) 


are filled, and all above p are empty. The Fermi energy is of the order of 
several electron volts (Problem 22-8). 

The (kinetic) energy at T = 0 is 


E = 



de 


ZNjx 

5 


(22-27) 


That is, the average energy (ground state as zero) per particle is 3/x/5. 
This is the zero-point energy of the system. It is large because of the Pauli 
exclusion principle. If we use Eq. (22-26) to eliminate p from Eq. (22-27), 
we have H as a function of 77 and 7. We see, then, that CV = 0 (T —* 0). 
An early problem in the (classical) theory of metals was to try to reconcile 
the fact that although “free” electrons exist as far as thermal and electrical 
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conductivity are concerned, they still do not contribute significantly to 
the heat capacity (the expected value was Sk/2 per electron). We have 
here the quantum-mechanical explanation: at ordinary temperatures the 
vast majority of electrons are in the filled states, e < p; the only electrons 
that have higher energy levels available (i.e., unoccupied) to which they 
can jump (and absorb heat) are those near e — /it. The first nonzero 
term in CV, expanded in powers of T, is given below in Eq. (22-31). 

We can find the pressure from Eq. (22-22). When m > « (and T —> 0), 
it is permissible to drop the unity. When p < e, we drop the exponential. 
Thus 

pV = kT^ (^i) «(e) ds = 27 M - £ 

= $Np = (22-28) 


The electrons in a metal exert a very high pressure (even at T = 0, 
because of the large zero-point kinetic energy), of the order of 10® atm. 
This pressure is balanced, however, by a potential energy contribution 
from dv/dV, which we have not included explicitly. 

We digress here to note that the result pV = 2Z?/3 is rather general: it 
holds for any noninteracting (ideal) gas of particles in a three-dimensional 
box, irrespective of statistics or temperature. To see this in a formal way, 
put «(e) de — Ce 112 de (Eq. 4-9) in Eqs. (22-21) and (22-22): 


E 


pV = ±kTC 


= cf 

Jo 

r 

Jo 


" < 8/2 de 

e <-»' kT d= 1 ’ 

e 1,2 ln [1 ± e ( '- )/ir ] de. 


An explicit expression for C is not needed here. An integration of the pV 
equation by parts leads immediately to pV — 2E/Z. The physical 
explanation of this general result has already been given in connection 
with Eq. (4-19). 

For the entropy of a Fermi-Dirac gas at T = 0, we have 

S = 0, (22-29) 


since there is only one way (ft = 1) to put 27 indistinguishable particles 
in the lowest 27 energy states, one per state. For T > 0, S is proportional 
to T (see Eq. (22-31)]. 

The above equations for the low-temperature limit are the most im¬ 
portant for this simple model of a metal. To find the next term in expan¬ 
sions in power of T requires straightforward but quite lengthy mathe¬ 
matical manipulations, which we omit because the results obtained are 
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not sufficiently important for our purposes. (We shall see below that the 
corrections found are rather negligible.) Let us simply state, as a typical 
example, the result for the energy: 


+-]. <»*» 

where p 0 is the Fermi energy: 



This gives E as a function of the independent variables N, 7, T (closed, 
isothermal system). Since /x 0 is of the order of electron volts and 1 ev 
corresponds to kT with T — 1.16 X 10 4o K, we see that the correction in 
Eq. (22-30) at, say, T = 300°K, is of the order of only 0.1%. 

Other thermodynamic functions follow easily from Eq. (22-30) (Problem 
22-0). For example, 



My f- + 

* Po 


(22-31) 


This predicts that Cv is proportional to T, a conclusion which has been 
verified at low temperatures (where the Debye T 3 heat capacity of the 
metal—arising from vibration of positive ions—goes rapidly to zero). 
Also, integration of Cv/T leads to S « T. At room temperature, Eq. 
(22-31) gives, in agreement with experiment, a heat capacity which is of 
the order of only one percent of (a) the classical value for free electrons, 
or (b) the vibrational heat capacity of the positive ions (Chapter 6). 
Figure 22-3 shows the heat capacity of an ideal Fermi-Dirac gas over 
the whole temperature range. 

A much more detailed account of this subject is available in a number 
of places. See especially Mayer and Mayer, Chapter 16. 



Fio. 22-3. Heat capacity of ideal Bose-Einstein and Fermi-Dirac gases. To 
is defined in both cases by Eq. (22-45). 
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22-3 Ideal Bose-Einstein gas; helium. We saw in the preceding section 
that very strong quantum effects occur in an electron gas even at ordinary 
temperatures. In this section we shall be concerned with a gas of non¬ 
interacting Bose-Einstein molecules (ideal Bose-Einstein gas). The mass 
of a particle (molecule) here is several thousand times the mass of an 
electron. Hence quantum effects are much smaller: they are absent at 
room temperature and become very strong only at very low temperatures 
(a few degrees absolute). We found, for example, in Table 4-1 that X -i... = 
A 3 N/V —1.5 for helium at the liquid density and 4.2°K. The condition 
for classical behavior, it will be recalled, is X 0 i aM <5C 1. We conclude from 
the above remarks that in the present problem (and of course also for a 
Fermi-Dirac gas of molecules) we have to consider both weak and strong 
quantum effects. Low-temperature expressions (strong degeneracy) are 
not valid over the whole range of practical interest, as is the case with 
electrons in a metal. 

The specific example we choose is helium. Helium is almost the lightest 
gas available and it has the weakest known intermolecular forces. Hence, 
with helium, quantum effects have the best possible chance to make them¬ 
selves evident in the gas at low temperatures without intermolecular forces 
confusing the issue. Unfortunately, it turns out that the extremely in¬ 
teresting condensation properties of helium, observed experimentally, are 
due to an inextricable mixture of quantum and intermolecular force 
effects. But there is little doubt that the “condensation” in an ideal Bose- 
Einstein gas, which we shall study below, is intimately related to the actual 
behavior of helium at low temperatures, though it furnishes only part 
of the story. 

Let us begin by investigating the region of slight degeneracy (low 
density or high temperature). This is most simply done using essentially 
the grand-partition-function, virial-expansion method of imperfect gas 
theory (Chapter 15). The question of the range of convergence of the series 
will be examined later. We shall write explicit expressions only through 
the second virial coefficient and related terms, but it is easy to extend the 
various series further [see, for example, Problem 22-10 and Eq. (22-89)]. 
Up to the second virial coefficient there is no inconvenience in including 
both Fermi-Dirac (upper sign) and Bose-Einstein (lower Bign) cases in 
the same equations, so we may as well do this. 

We begin with 

pV = kT InS 


= ±2 irkT (|r) 3 2 vj In (1 ± e~' lkT \) de. (23-32) 
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For small X we can expand the logarithm and integrate term by term: 

pV = =F2 vkT (|f 2 V 2 jf e m e~ i,lkT de, 

or 

JL_ x 1 V (=F1)V 

fcT A3 3 m (22-33) 

In the limit of zero density (X —* 0), p/fcT = X/A 3 = p (classical ideal 
gas; Chapter 4). As usual, we define the activity z so that z —* p when 
p —♦ 0. That is, z ss X/A 3 . If we replace X by 2 in Eq. (22-33), we have 


where 


P_ 

kT 


£ HTW, 

tel 


bj(T) 


(=Fl) y-1 A 3U-1> 

jSI2 


(22-34) 

(22-35) 


Equation (22-34) is in the same form as Eq. (15-7) of imperfect gas theory. 
While those departures from the ideal gas law (p/kT — z = p) that were 
discussed in Section 15-2 were due to intermolecular forces in a classical 
gas, here they are due to quantum effects. Because Eq. (22-34) is a special 
case of Eq. (15-7), we can use the general methods of Chapter 15 without 
modification: Eqs. (15-7) through (15-21) are all applicable if we take 
Eq. (22-35) for the bj. Also, we have 

Wkf = 2 ~i*T = 2[ 1 + 5 1 B * +l(:r)p ] (22-36) 

and (see also Problem 22-11) 

S _ ~E _p _ p_ 

M ~ 7JkT + pkT kT 

= | - In A 3 p + g (| - ±±1^ B i+1 p k . (22-37) 

Next, we list a few explicit equations (weak degeneracy) through the 
second virial coefficient, using B 2 — —b 2 = ±A 3 /2 5/2 (upper sign, 
Fermi-Dirac): 

pkT ~ 1 =fc 26/2 P + ''' ’ (22-38) 

lnA3 P±^72p + ---’ 


(22-39) 
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JE_ _ 3 
FAT 2 


(‘ 



S_ = 5 

fa 2 


lnAp ± + 


Cv = 3 
FA 2 



A* 

27/2 P + 



(22-40) 

(22-41) 

(22-42) 


The following comments on these equations may be of interest, (a) The 
sign of f? 2 corresponds to a “repulsion” between Fermi-Dirac molecules 
and an “attraction” between Bose-Einstein molecules, (b) The magnitude 
of the purely quantum second virial coefficient above is B$ — 0(A 3 ). 
The magnitude of a classical B 2 arising from intermolecular forces is 
B% = O(l/pcond), where pcond is the number density of a condensed phase. 
The ratio of the two magnitudes is then = 0(A 3 pcond). But this 

is just the order of the quantity calculated in Table 4-1; hence, we have 
a verification of the fact that in helium, at about 4°K, quantum and inter¬ 
molecular effects both contribute significantly to B 2 . (c) If intermolecular 
forces are suddenly “switched on” in a classical monatomic ideal gas 
(p and T constant), the entropy decreases whether the forces are repulsive 
or attractive (Problem 15—11). But note in Eq. (22-41) that although a 
Bose-Einstein gas has a lower entropy than a classical gas, a Fermi-Dirac 
gas has a higher entropy. This shows that the analogy between quantum 
effects and intermolecular forces referred to in (a) cannot be pushed too far. 
(d) These series may be regarded not only as in powers of p (coefficients 
are functions of T) but also as in powers of A 3 p (coefficients are constants), 
powers of 7’~ 3/2 (coefficients are functions of p), or powers of h 3 (coefficients 
are functions of p and T). (e) Equation (22-42) informs us that the Cv 
curves in Fig. 22-3 deviate from the classical value at high temperature 
by a quantity proportional to T~ 312 . 

The above series are valid for small X. Now let us examine what happens, 
in Bose-Einstein statistics (the complications we find below do not arise 
in Fermi-Dirac statistics) when we take larger values of X. First, we note, 
from Eq. (22-17), that the number of molecules in the ground state 
(< I = 0) is 3i = X/(l — X) and hence that X can never be greater than 
unity (p cannot be positive). We are interested, then, in the range 
0 < X < 1. Next, let us look at the integral form of the equation for F: 





We multiply numerator and denominator of the integrand by e~ ,lkT \, 
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expand (1 — e~ ,lkT \) \ integrate term by term, and find 

jj = (^iy 12 VF m (a), (22-43) 

where 



and the function F,(a) is defined by 

F.(a) = Ei'V*, 

’~ l (22-44) 

f.(o) = Y,r a = m, 

i— l 


where f(<r) is the Riemann f-function. Since p cannot be positive, a 
cannot be negative. As X increases from 0 to 1, a decreases from +oo to 0. 
In order to study increasing degeneracy, then, we can start with large a 
and let a decrease toward zero. In this process the function F 3 / 2 (a) 
increases monotonically (something like e~“, but more rapidly for small 
a) from a value zero at a = +oo to a value f(3/2) = 2.612 at a = 0. 
Thus, for given V and T, TV increases from zero at a = +w to a maximum 
value 2.6127/A 3 at a = 0. But this result is inconsistent with the rela¬ 
tion Si = X/(l — X), which implies that Si and hence 77 (since 77 > S x ) 
can be made arbitrarily large by taking X arbitrarily close to unity. The 
reason for the discrepancy is that in converting the sum (22-20) to an 
integral, we have incorrectly omitted a contribution from the ground 
state, because the weight «(e) « e 1,2 is zero at e = 0. 

Let us now change our point of view a little and suppose that N/V = p 
is fixed, and that Eq. (22-43) determines a as a function of T. This equa¬ 
tion is valid so long as a is significantly greater than zero or X is sig¬ 
nificantly less than unity, for in this case the number of molecules in the 
ground state, X/(l — X), is of negligible order compared with N. For 
example, if a — 10~°, X = 1 — 10 -6 , and Si = 10®, which is negligible 
in relation toiV. At high temperatures, a is large and Eq. (22-43) is equiv¬ 
alent to the series (22-39) (lower sign). As T decreases, a decreases 
until the limiting value a = 0+ (or p = 0—) is reached at a special 
temperature T 0 (p) defined by the equation 


( h 2 y /2 

\2irmkT 0 ) 


f(3/2) = 2.612 
P P 


(22-45) 


At this temperature there is apparently some kind of singularity. Hence, 
for given p, Eq. (22-43) and the series (22-38) through (22-42) are valid 
for T > T 0 , but not for T < T 0 - Alternatively, we can say that, for 
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given T, these series are valid for p < p 0 (T) but 
po is defined by 



not for p > po, where 


(22-46) 


The numerical value of To turns out to be 3.13°K at the density of 
liquid helium (27.6 cm 3 -mole -1 ). This is of the same order of magnitude 
as the temperature of the (higher-order) phase transition in helium (re¬ 
ferred to again below), 2.19 S K. Closer agreement is not to be expected, 
since intermolecular forces are being neglected in the present discussion. 

For T < T 0 , and N and 7 fixed, the right side of Eq. (22-43) gives 
the number of molecules in excited states. Then N is equal to this number 
plus the number in the ground state: 


N = 


1 , (2vmkT\ m 

e“ — 1 + V *2 / 


VF m (cc). 


This equation has a solution for a if a is only slightly greater than zero: 

*-:+(“r "*'■<«+> - i +"ty”' 

where we have eliminated 7 by use of Eq. (22-45). Then 

^ = - y = *! = A^ 1 - QQ 8/2 ] W < To). (22-47) 

This verifies that a and p are essentially zero in the whole range 0 < T < 
To'a — 0(N~ l ) and —p = 0(kT/N). AtTSO,-p= kT/N, in agree- 




Fig. 22-4. Number of molecules in the ground state as a function of (a) tem¬ 
perature (density constant) and (b) volume per molecule (temperature constant), 
for an ideal Bose-Einstein gas. 
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ment with Eq. (22-18). Figure 22-4(a) shows how the number of molecules 
Si in the ground state varies with T. Above T 0 , Si is a thermodynamically 
negligible number, but starting suddenly at To (as T is decreased), there 
is a "condensation,” called the “Bose-Einstein condensation,” of a macro¬ 
scopic number of molecules into the ground state. At T = 0, all molecules 
are in the ground state: Si = N. 

Alternatively, for p > po(T) with T constant, we have from Eqs. 
(22-45) and (22-46), 

Si = N (l - (p > p 0 ), (22-48) 


as shown in Fig. 22-4(b). 

We shall return to the Bose-Einstein condensation following investiga¬ 
tion of other thermodynamic properties. The equation for the energy of 
a Bose-Einstein gas is 


T r f. 

-MW 


c 3/a dt 
e t,kT \~ l — 

VF m (a) 


1 


(22-49) 



F 3 / 2 W 


(22-50) 


For T > T 0 (p constant) or p < p 0 (T constant), Eq. (22-50) is equiva¬ 
lent to Eq. (22-40) [where a has been eliminated as an independent 
variable in favor of p, using essentially Eq. (22-39)]. For T < To, the 
macroscopically significant number of molecules in the ground state [not 
taken care of by the integral in Eq. (22-49)] do not contribute to the 
energy anyhow, since «i = 0. Therefore Eq. (22-49) also holds for 
T < T 0 if we put a = 0+: 


E = 



(T < To). 


(22-51) 


The numerical value of f(5/2) is 1.342, and ?(5/2)/f(3/2) = 0.5134. We 
note that E « T il2 , for T < T 0 . The heat capacity is 


C v = 




j( 5/2) 
f(3/2) 



(T < T 0 ), (22-52) 


or Cv * T 312 . The complete Cy from Eqs. (22-42) and (22-52) is shown 
in Fig. 22-3. There is no discontinuity in Cv at T 0 , but there is a dis- 
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Fig. 22-5. Experimental beat capacity of liquid helium under its own vapor 
pressure. 

continuity in dCv/dT. The experimental heat capacity of liquid helium 
under its own vapor pressure (neither Cv nor C p ) is shown in Fig. 22-5, 
with a singularity at 2.19°K. Because of the shape of this curve, the 
singularity is often referred to as a X-transition. Experimental curves of 
this kind are also found in the order-disorder transition in alloys (Sec¬ 
tion 20-1). 

The equations for pV follow immediately from pV = (2/3)1?. We 
already have the case T > T 0 or p < p 0 in Eq. (22-38). This can be 
deduced from Eqs. (22-50) for E and (22-39) for a (or p). For T < T 0 
or p > p 0 , we have from Eq. (22-51), 

py - »«■ (fT < t < <zm > 

or 

§T - <0 > <*>• ^ 

Equation (22-54) is particularly important, for it states that if T is held 
constant, p remains constant in value for p > po- The p-v isotherms are 
shown in Fig. 22-6. 

It should now be clear that, thermodynamically, the Bose-Einstein con¬ 
densation is a first-order phase transition (but the experimental transition 
in helium is not first order). The two phases in equilibrium are the con¬ 
densed phase with T, 1/p = v — 0 and the dilute .phase with T, 
v = l/p 0 (T). The two phases have the same pressure (the vapor pressure), 
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P 



Fig. 22-6. Pressure-volume isotherms for an ideal Bose-Einstein gas. 

p = 0.514fcTp 0 (r), and the same chemical potential, p = 0. How¬ 
ever, this first-order phase transition is very unusual on the molecular 
level. When 0 < v < l/p 0 (T), that is, in the flat portion of a p-v isotherm 
(Fig. 22-6), the system has a uniform macroscopic density l/t> (instead 
of the customary two different densities). As it is usually stated, the 
condensation occurs in momentum space rather than in coordinate space: 
the condensed phase consists of molecules with zero energy and momen¬ 
tum, and macroscopic de Broglie wavelength. (Actually a zero-point 
energy and momentum exist for a molecule in a box, but they are com¬ 
pletely negligible in magnitude since V is macroscopic.) 

If T < To or p > p 0 , the entropy is given by 

» E + pV-Np 

^ “ rp 

E + pV = H = 5E 
T T 3 T 

iT<T ° ) ■ ^ 

Hence S « T 3/2 and S = 0 at T = 0 (all molecules are in the ground 
state). For the heat of the phase transition, per molecule, we then find 





22-4] BLACKBODY RADIATION (PHOTON GAS) 453 

[replace (T/T 0 ) 312 by p 0 /p in Eq. (22-55)] 

AH _ TAS _ T(Sdn Scond) 

N ~ N ~ N 

_ T[S(p = po) - S(f> = oo)] _ 5 f(5/2) 

N ~ 2 f(3-2)' ( 56) 

For further details the reader is advised to consult the treatment by 
F. London (Supplementary Reading list). 

22-4 Blackbody radiation (photon gas). Here we give a brief derivation 
of the thermodynamic properties of electromagnetic radiation in thermal 
equilibrium, which we regard as a gas of photons. We consider the photons 
to be in a container of volume V with heat insulating and perfectly re¬ 
flecting inside walls. Thus the system is isolated and has a definite energy 
E. Because photons do not interact with each other, a very small black¬ 
body is assumed to be present in the container to absorb and emit photons, 
thus making thermal equilibrium possible. Because photons absorbed at 
one frequency might be emitted at another, and in different numbers to 
conserve energy, the total number N of photons in the system is not fixed, 
even though the system is isolated. 

There are two essential respects, then, in which a photon gas differs 
from a gas of particles with nonzero rest mass (e.g., electrons or helium 
atoms): (1) the photons do not interact with each other, so that in the 
photon gas no approximation is made by neglecting interparticle inter¬ 
actions; and (2) the number of particles in an isolated system is not 
conserved. 

Thermodynamically, the system is characterized by E and V only 
(not E, V, and N, as usual for an isolated system). 

Photons have spin one (units of h/2w) and hence obey Bose-Einstein 
statistics. The spin degeneracy is two (corresponding to two independent 
directions of polarization); it is not three, as would be the case for particles 
with spin one and nonzero rest mass. 

Electrons also have a spin degeneracy of two, so we might expect to be 
able to take over Eq. (22-23) for w(e) dt, the number of energy eigenstates 
between € and e + dt, for a single particle in V. However, because of their 
zero rest mass, Eq. (22-23) is not applicable to photons. Instead, we write 
this equation in terms of the absolute value of the momentum (c = p 2 /2m, 
p > 0 , since the energy is all kinetic): 

8wVp 2 dp 
A3 


is the number of quantum states with p between p and p + dp. (Note 
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that we use p for both momentum and pressure in this section.) This 
expression does not involve the mass. In applying it to photons, we use 
the de Broglie relation, 

<aw7) 


to again change variables, this time from p to v. This gives 

G(v) dv = (22-58) 

as the number of quantum states with frequencies between v and v + dv. 
Note that Eq. (22-58) agrees with the “transverse part” of Eq. (VI-30) 
for g(v) dv in Debye’s continuum model for a crystal. In fact a wave 
argument of the type presented in Appendix VI is ordinarily used to derive 
Eq. (22-58). We have, instead, taken the particle point of view. This 
illustrates the fact that photons may be viewed either as particles or waves. 

We now wish to apply the relations (22-20) through (22-22) for N, E, 
and pV to the special case of a photon gas. We use the lower sign for 
Bose-Einstein statistics, take Eq. (22-58) as the weighting function in 
converting the sums to integrals, and put 6 = hv for the energy of a 
photon. Also, because of the fact that this (isolated) system is defined 
thermodynamically by E and V only, we can make a special assignment 
of p at the outset. We first consider this point. 

We have already encountered other isolated systems in which the 
numbers of molecules are not fixed: systems with chemical equilibria. Let 
us review this topic briefly (see Chapter 10, especially Section 10-2). 
Suppose we have a two-component system characterized by E, V, Na, 
and Nb- Then a catalyst is added making possible a chemical reaction 
between A and B, say A B. Thermodynamically, this reduces the 
number of independent components by one, from two to one. The equi¬ 
librium point can be located by, for example, maximizing S with respect 
to Na holding E, V, and Na + Nb constant, minimizing F (or A) with 
respect to Na holding p (or V), T, and Na + Nb constant, etc. In any 
case, the equilibrium condition Ap = pa — pa = 0 is found. The situa¬ 
tion in a photon gas is rather similar. The small blackbody catalyzes the 
“reaction” nA *=* ml (n and m arbitrary integers, A = photon), which 
leads to a number of independent components one less than would other¬ 
wise be the case (i.e., the number of independent components here is zero 
instead of one). The equilibrium point is located by maximizing S with 
respect to N holding E and V constant, minimizing F (or A) with respect 
to N holding p (or V) and T constant, etc. In any case, all the relations 
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give fi = 0. We also get ju = 0 from 

A#t = m/x — n/t = (m — n)p — 0. 


In summary: because the number N of photons is not conserved in an 
isolated system, N must be regarded as a parameter and not as an inde¬ 
pendent thermodynamic variable; location of the equilibrium point with 
respect to the value of the parameter leads to the condition /* = 0. 

Whereas a molecular Bose-Einstein gas has = 0 for T < T 0 , a photon 
gas has /t = 0 for all temperatures. 

The remainder of the analysis is very straightforward. Equations 
(22-20) through (22-22) become 


E 


8wF r v 2 dv 

C» Jo T _ j ’ 

(22-59) 

8 vVh v*dv 

(22-60) 

C 3 J 0 e hPlkT _ J ’ 

SvV a kT J In (1 e~ h " kT ) dv. 

(22-61) 


The integrand in Eq. (22-60) is essentially the blackbody energy distribu¬ 
tion function, discovered at first empirically by Planck in 1900, which 
provided the starting point for the whole development of quantum theory. 
If we write this integrand as 


J ,3 g -Ai>/*r 


c 


,h*lkT 


1 1 - e 


-hr/kT 


= v 3 £ (e- hrlkT ) n , 


»>1 


we find 




Since f (4) = t 4 /90, we have 


8 * 6 V(kT) 4 
15 (he) 3 


(22-62) 


Note that E is a function of V and T only; or T is a function of E/V only. 
The fact that the total energy of the radiation is proportional to T 4 is 
the well-known Stefan-Boltzmann law. The heat capacity is 


„ tdE\ 32r*Vk ArY 
Cv = \df)v -I5~W/ ' 


(22-63) 
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The temperature dependence of E and CV is the same as that of a crystal 
at low temperatures, as we should expect from Sections 5-3 and 5-4 and 
the discussion following Eq. (22-58). The resemblance between Eqs. 
(5-33) and (22-60) for E should also be noted. Because of the relation 
between these two problems, the acoustical quanta in a crystal are often 
regarded as forming a phonon gas in analogy to a photon gas. 

Integration of Eq. (22-60) by parts leads to 

pV = f- (22-64) 


The reason why we get pV = E/3 here and not pV = 2E/3, as in earlier 
sections, is that this is an extreme relativistic system (e = cp instead of 
p 2 /2m, p = momentum). Since E/V is a function of T only, the pressure 
is a function of T only, as in a first-order phase transition. This result 
might have been expected since p — constant (=0). Equation (22-54) 
for the pressure in the Bose-Einstein condensation is similar. 

The radiation pressure p = E /37 is completely negligible in magni¬ 
tude except at extreme temperatures. For example (Problem 22-12), 
p — 1/5 atm at 10®°K (atomic bomb explosion). 

We handle Eq. (22-59) for N just as we did Eq. (22-60) for E and find 
for the equilibrium number of photons 

N = 16a*(3)y • (22-65) 


where f(3) = 1.202. Thus N is determined by V and T (or E and V, 
etc.). We can combine Eqs. (22-62), (22-64), and (22-65) to obtain 

ir*NkT 

E = ~~ = 2.70lNkT, (22-66) 

pV = | = 0.900 NkT. (22-67) 

The entropy follows from 


Np = 0 = E - TS + pV, 

E + pV 4 E 32tt *Vk(kT) 3 

T 3T 45 (Ac) 3 


( 22 - 68 ) 


Thus S « T 3 and <S = 0 at T = 0. 


22-5 Quantum statistics with intermolecular interactions. This is a 
very involved subject, so we must confine ourselves to a brief introduction 
to it. Further details will be found in S. M., Sections 8-11 and 16, and, 
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especially, in Hirschfelder, et al., Chapters 1, 2, and 6. Many of the 
contributions in this held are due to J. de Boer and his collaborators. 
Because of their complexity, we omit entirely any discussion of the many 
current attempts being made to attack the liquid helium and other quan¬ 
tum fluid problems exactly. 

The method of Sections 22-1 to 22-4 breaks down when intermolecular 
interactions are present, for the energy eigenvalues (eigenfunctions) of 
the whole system can no longer be written simply as a sum (product) of 
the energy eigenvalues (eigenfunctions) of the individual molecules in 
the system. We have to return instead to the completely general equa¬ 
tion (22-3), 

Q = J) e~ E,skT , (22-69) 

i 

where the sum is over all energy states accessible to the particular system 
under consideration. 

There are alternative but equivalent forms of Eq. (22-69) that are 
often useful. Let 4>j be the normalized energy eigenfunction belonging 
to the eigenvalue Ej. That is, X fy(g) = Etfjiq), where X is the Hamil¬ 
tonian operator for the system, 3C = 3C (kinetic energy operator) + U(q) 
(potential energy), and q represents all the coordinates. Then we can write 

Q = 'Ee- B i' kT fi'MHq) d q 
= £ hl(q)e- B>ltT Uq) dq. (22-70) 

i J 

Let us define the operator e~ xlkT by 

-X/kT _ , _ 3C 1 _ 

kT ' r 2 (fcr)2 

Then, since, 

x m Uq)*=E?Uq) (»»= 1 , 2 ,...), 

we have 

e^‘ kT Uq) = e- EilkT Uq), 

and Eq. (22-70) becomes 

Q = £/ m)e-^' kT Uq) dq. (22-71) 

i J 

Actually, there is a more general form of Eq. (22-71): 

Q = f <pi(q)e~ KlkT <Pn(q) dq, 


(22-72) 
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where the <p n are any complete set of orthonormal functions. To prove 
this, we expand <p n in the set of functions 

¥>n = E A *&i> 
i 

Anj — Jdq. 

Substitution of this expansion for <p n in Eq. (22-72) gives 

Q = E e ~* ilkT E 

j » 

= '22e~ BilkT . 
i 

The last step can be justified by expanding the function in the set v>» 
and then forming the integral dq = 1. Thus both Eqs. (22-71) and 
(22-72) are equivalent to Eq. (22-60) and hence are equivalent to each 
other. Equation (22-71) is the special case of (22-72) obtained when we 
choose the energy eigenfunctions for the set <p n - 
One quite general application of Eq. (22-72) is the following: it allows 
us to write the quantum-mechanical analog of the classical statistical- 
mechanical quantity e~ ulkT (recall that e~ vlhT is proportional to the 
configurational probability density). Consider, for example, a one- 
component system of monatomic molecules. In classical mechanics 
(Eq. 6-21) 

Q= f e ~ Uit)lkT dq- (22-73) 

Equation (22-72) also involves an integral over dq. Comparison of (22-72) 
and (22-73) shows that the analog of e~ vlkT is 

[S] = m\ 3rf E ^(7)e~ K/ * 7 V»(9) (22-74) 

n 

= ama 3 - v E +*(.q)Mq)e- EilkT - (22-75) 

i 

We call {S] the “Slater sum.” We can easily verify that [S] has the same 
physical significance as e~ vlkT by noting that 

-BjlkT 

—q -^*( 9 )^( 9 ) dq 

is the probability of observing the system in the energy state E, and also 
in a configuration between q and q + dq. If we sum this quantity over j, 
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we have the probability of observing a configuration between q and 
q + dq, irrespective of j. This sum over j is essentially that in (22-75). 

As a very simple example of Eq. (22-75) we can calculate [&] for two 
noninteracting particles in a one-dimensional box of length L. We find 
(Problem 22-13) 

[5] = 1 =F e - 2 '<*‘ -**>*'*’ (upper sign FD), (22-76) 

where x t and x 2 are the coordinates of the two particles. A negligible 
term in (x, + x 2 ) 2 has been dropped here. Clearly, there is an effective 
interaction between the particles when their distance apart is of order A 
(thermal de Broglie wavelength) or less, an attraction in the Bose-Einstein 
case and a repulsion in the Fermi-Dirac case. The leading term (unity) 
is the classical value ( h , A —> 0) for noninteracting molecules. 

For two particles in three dimensions (see Hirschfelder, et al., p. 402), 

t-Sf] = 1 (upper sign FD). (22-77) 


In view of Eqs. (22-72) and (22-74) this gives 
<?2 = 2 X« /[5] dr, di 2 

2A« 2»/*A» 


(22-78) 


The equations of Section 15-1 then lead to the second virial coefficient 


Bo — ± 


28 /* 


(22-79) 


in agreement with Eq. (22-38). 

In general, we can write for monatomic molecules [compare Eq. (6-21)] 


Qn = 


Zn 


where 


JV1A8W 

Zn = f[S\ dr,... dr at. 


(22-80) 


This is the quantum-mechanical configuration integral. Even if Un is 
pairwise additive, in general the quantity —kT In [5] (the quantum 
analog of Un in Zn) will not be. Hence quantum analogs of equations 
such as (15-30) for B s , B iy etc., do not exist (but we recall that the equa¬ 
tions of Section 15-1 are general and apply to the quantum case). 

It may be instructive if we also evaluate Q 2 (Eq. 22-78) for two non¬ 
interacting molecules in V using Eq. (22-69) directly. We are seeking just 
first-order quantum effects; for this purpose, we can replace sums by 
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integrals. First, we have from the energy-level expression (4-2) that Qi 
is given by (this is Problem 4-2) 

o.-EEE exp [-(Z 2 + it + l 2 ,)h 2 /8mV 2l3 kT] 

lx. h. i,z i 

= IJI exp [ ] dl x dly dl t = • 

o 

Now, to start on Q 2 we take the corresponding sixfold sum over all 
possible values of 4i> hu hi, h 2 , ly2, and hi' 

£ • • • £ exp [-(& +••• + l 2 ti)h 2 /8mV vz kT). 

lx . lx J-l 


The value of this sum (from integration) is obviously (F/A 3 ) 2 . In this 
sum we have counted, for example, the states 


and 


hi, • • • , hi — 4, 9,8,13, 3,3 
hi, • • •, ht — 13, 3, 3,4, 9, 8 


as separate states (with the same energy). Because of indistinguishability, 
this is wrong, and we divide the above sum by two, giving F*/2A 6 . But 
this correction is not proper for those states in which 


for example, 


hi 1 hi, hi — h2, hi, hi', 
hi, • • • ,1x2 — 4 , 9 , 8 , 4 , 9 , 8 . 


Such states should not be included at all in Fermi-Dirac statistics, and 
their contribution should not have been divided by two in Bose-Einstein 
statistics (because each occurs only once in the sixfold sum). We can 
compensate for these errors by adding an appropriate term. Thus for Q 2 
we have (upper sign Fermi-Dirac) 

O’-ro^jEE E exp [-2(& + l 2 j + /* i)h 2 /8m.V 2l3 kT]. 


Ill• ly I. It 1^1 


(22-81) 


On replacing the sums by integrals, we again get Eq. (22-78). 

The calculation of the quantum-mechanical second virial coefficient 
for interacting molecules (e.g., a Lennard-Jones 6-12 interaction potential) 
has been carried out for several monatomic gases, but the procedure is 
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too complicated to discuss here (see Hirschfelder, et al., Chapter 6). At 
low temperatures, Q 2 in the form Y,je~ Ei,kT was the starting point. At 
higher temperatures, Q 2 in the form of an integral of the Slater sum (Eq. 
22-80), using momentum eigenfunctions for the <p n , was the starting point. 
Figure 15-1 contains curves for the theoretical (high-temperature method) 
and experimental second virial coefficients of H 2 and He, using the 
Lennard-Jones potential with values of the parameters € and r* chosen 
to give best fit (these are the values in Table IV-1). Agreement is excellent. 
The curves for H 2 and He depart significantly from the classical curve. 

At high temperatures (slight quantum effects) the explicit equation for 
B 2 is found to be (upper sign FD) 

B 2 (T) = —2x f o [ e -^' kT - 1 ]r 2 dr ± ^ 

i ft 2 f —u(r)lkT (du\ 2 2 

247m (/cT) 3 J 0 \dr) 

+ 0(h*/m 2 ). (22-82) 


The first term is the classical B 2 (Eq. 15-24), and the second term (of 
order h 3 /m ai 2 ) is the quantum B 2 for noninteracting molecules (Eq. 22-79). 

It is appropriate at this point to discuss the modification in the law of 
corresponding states (Sections 15-2 and 16—1) that must be made for 
quantum fluids. We consider, as usual, monatomic gases with a pair 
interaction potential of the form w(r) = eh(r/r*) (this h is an arbitrary 
function and not Planck’s constant). The Lennard-Jones G-12 potential 
is an example. The virial expansion can be written 


po 

kT ~ 


1 + 


B 2 (D ( r* 3 \ Bz(T) ( r* 8 V 
r* 3 \ v / r* 6 \ v J 


+ •••• 


(22-83) 


In the classical case, we have already seen that B 2 /r* 3 , B 3 /r*°, etc., are 
universal functions of kT/e only, and hence pv/kT is a universal function 
of kT/e and v/r* 3 (law of corresponding states). In quantum statistics 
the thermodynamic expansion (22-83) of course still holds, but we observe 
on inspection of Eq. (22-82) that B 2 /r* 3 is now a function (different in 
Fermi-Dirac and Bose-Einstein statistics) of kT/e and also of the quantum 
parameter h/(me) ll2 r*, where h/(me) 1/2 is essentially the de Broglie 
wavelength for a molecule with energy e. This is true as well for the higher 
virial coefficients B 3 /r* 6 , etc. (as can be verified by dimensional analysis). 
Therefore pv/kT is a function, different in the two statistics, of the reduced 
thermodynamic variables kT/e and v/r* 3 , and also of the reduced quantum 
parameter h/(me) I/2 r*. These two functions approach each other and 
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become the universal classical function (Section 16-4) in the limit 
h/(mt) 1/2 r* —» 0. The B 2 /r* 3 curves for H 2 and He (both Bose-Einstein) 
in Fig. 15-1 do not coincide because they have different values of the 
quantum parameter (Problem 22-14). In fact, the three curves in 
Fig. 15-1 may be regarded as a Bose-Einstein family of curves correspond¬ 
ing to three different choices of the quantum parameter. 

Hirschfelder et al. give many more details about the quantum-mechanical 
law of corresponding states and its applications. 

The next section contains another application of the quantum equation 
for Q, Eq. (22-72). 

22-6 The factors A" and AH in classical statistics. In Chapters 3,4, and 
6 we found it necessary to introduce the factors A“ (n degrees of freedom) 
and iV! (N molecules) into the classical canonical ensemble partition func¬ 
tion [see, for example, Eq. (6-25)]. In Chapter 6, by appealing to special 
cases, we justified the use of A" as the volume in classical phase space cor¬ 
responding to a single quantum state. Here we give an argument, due to 
Kirkwood, which provides a general justification for this. The appro¬ 
priateness of the factor AH has already been demonstrated adequately 
in Sections 3-2, 6-2, and 22-1. In the case of AH there can be no doubt 
about the correct quantitative expression [whereas with A" we might 
have expected, say, (A/4ir) n , etc.]. In fact, we can deduce the factor AH 
simply by combining the quantum-mechanical idea of indistinguishability 
of identical particles with the classical phase integral (Eq. 6-18). But 
Kirkwood’s method can be extended (we omit this here) to provide a 
formed verification of the AH correction. 

We consider a system of N point particles with a potential energy U(r), 
where r stands for ... , r N . We use cartesian coordinates, even though 
for some systems other coordinates would be more natural. For example, 
the system might consist of two atoms, N = 2, forming a diatomic 
molecule, in which case the rotational (0, <f>), vibrational (ri 2 ), and trans¬ 
lational (x, y, z of center of mass) coordinates would be the conventional 
choice; but instead we employ = xj, y u z x and r 2 = x 2 , y 2 , z 2 . Our 
procedure is to obtain the desired result concerning A" with cartesian 
coordinates and then use the classical mechanical theorem, (VII-11), 
that an element of volume in phase space is invariant under a canonical 
transformation (q, p —> q', p')» to embrace any choice of coordinates. 

We start with Eq. (22-72), and use for the <p n (r) the normalized momen¬ 
tum eigenfunctions 

( 22 - 84 ) 

The normalization is such that if we expand the arbitrary function ^(r) 
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in momentum eigenfunctions, 

tw - mr, 

then by the inversion formula for the Fourier transform, 

•*<P> = p«7i /W'V-’""' 1 *. 

Whereas is the probability density for r, A* A is the probability 
density for p. Temporarily we ignore restrictions on the symmetry of 
wave functions. Substitution of Eq. (22-84) in Eq. (22-72) gives 

Q= hhL( e- 2 '* >r/ *e- 4C/tr e 2 ' <rr/ * dr dp. (22-85) 

It should be apparent that momentum eigenfunctions were chosen for 
the <p« in Eq. (22-72) because we now have, in (22-85), Q expressed as an 
integral over r and p—just as for the classical Q. Next, we define a func¬ 
tion w(r, p, T) by the equation 

e -sc/*V„>r/» = e-* /M, e 2 " >r/ * w(r, p, T). ( 22 - 86 ) 

If we put this in Eq. (22-85), we obtain 

Q = p/J e-" lkT w(r, p,T)dr dp. (22-87) 

Now all we have to show is that in the classical limit ( h —> 0), w —* 1. 

To do this, we go back to Eq. (22-86) defining to. For simplicity of 
notation, let us consider a one-dimensional (x) system here. Because we 
are using cartesian coordinates, the extension to any number of variables 
is very straightforward. The reader should verify this using two variables 
(x and y or xi and # 2 ). With the aid of the relations 

e -x/*r =1 ___ K + 2 \(kT)3 K * ’ 

K =-Si& +u M> 

d 2 V'**/* l 4*VY.*«p*/» 

dx2» ” V~ A* / 
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the left side of Eq. (22-8G) becomes 
e -XlkT e i.i*lk = C 2«px/^J _ + uj 

+ 2!(fc!T)2 \2m + U ) ^ J 

+ terms of order h 2 , h* . 

In the classical limit ( h —* 0), the terms of order h 2 , h*, etc., drop out, 
and there remains 


g—3ClkTg2ripz/h _^ g2wipxlhg—HjkT 


(h - 0 ). 


Therefore, from Eq. (22-86), 

w —► 1 as h —* 0. 


Kirkwood used a more elegant argument based on the so-called Bloch 
differential equation, but the result is the same. 

On referring back to Eq. (22-87), we see, then, that the factor h~ 3N in 
front of the classical phase integral is indeed correct, at least with cartesian 
coordinates. But in view of Eq. (VII-11), we can change variables from 
r, p to any set q', p' and still have 

Q =wLL e ~ H " tTd * dvr (a ^ o) - (22 - g8) 

This is what we set out to prove. 

We observe that Eq. (22-88) does not have the desired factor (IV!) -1 
in front of the integral signs. The reason is that we have ignored sym¬ 
metry restrictions on the wave functions [Eq. (22-88) is correct as it 
stands for a system of distinguishable particles—an Einstein crystal, for 
example]. Actually, the same kind of argument as above can be carried 
through, including symmetry effects (indistinguishable particles). But 
the argument is too sophisticated for the present text and we omit it 
(see S. M., pp. 85-89). The result is that h 3N in Eq. (22-88) is replaced 
by h 3N Nl 


22-7 Free-volume theories of quantum liquids. In Chapter 16 we 
discussed several approximate theories of classical liquids. Similar ap¬ 
proximations have been applied to quantum liquids. Because the theo¬ 
retical foundation is especially unsatisfactory when quantum effects are 
included in these theories, we shall merely sketch here what is perhaps 
the simplest procedure in converting a classical free-volume theory into 
a quantum theory. For further discussion of this general subject, see 
Prigogine (Chapter 18) and Band (Chapter 8). 
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To be specific, we consider a Bose-Einstein system. The same kind of 
argument will obviously apply to a Fermi-Dirac system. In Sections 
16-1 and 16-2, the transition from an ideal classical gas (no intermolecular 
interactions) to a real classical fluid was made by passing from the ideal 
gas equation (16-15), 

A , ve 
NkT ~ 111 A3 ’ 


to the “real fluid" equation (16-22), 

A _, n/(t>, T)e _ <p(v) 

NkT ~ m A® 2 kT' 


That is, we replace v by the free volume »/, and we add a potential energy 
term —<p(v)/2kT. We now follow exactly the same formal procedure for 
a Bose-Einstein fluid,* and for the same physical reasons. Thus, we first 
write the expression for the Helmholtz free energy of an ideal (no inter¬ 
actions) Bose-Einstein gas, from Eqs. (22-40), (22-41), (22-51), and 
(22-55): 


A 

NkT 


In + 0.4618 V -^ + 0.0112 
A3 t; L v J 


+ 0.00065* + •••, 


v > v 0 (T) (22-89) 


= 0.5134 


v 

Vo(T) ’ 


v < v 0 (T), 


(22-90) 


where we have included additional terms (see Problem 22-10) in the 
v > vo series and (Eq. 22-46) 

V ° (T) “ pdf) = 2^612' 

Next, we replace v by v/(v, T) and add the potential energy term 
—<p(v)/2kT: 


= ln^jff + 0.4618 — 
A 8 Vf 

+ 00112 (?)’+" 

2kT ^ V/ > V °^ 

(22-91) 

= 0.5134 — - 

v 0 2 kT 

(v/ < Vo). 

(22-92) 


Equations (22-91) and (22-92) now determine all the properties of the 
fluid once we adopt some definite model which provides v/ and <p. 


* T. L. Hill, J. Phyt. Chern. 51,1219 (1947). 
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The simplest example is the van der Waals model [Eqs. (16-4), (16-5), 
and (16-23)]: 


v/ — v — b, <p = 


2a, 

v 


b = 


2xr* 3 
3 ’ °* = 


2ir€r* 8 

3 


• (22-93) 


If we substitute these expressions for ty and <p in Eqs. (22-91) and (22-92), 
and then use p = —{dA/dV)s.T, we find for the equation of state 

& ~ (dry [i - o «is A - *«*> (j+s)* - • • •] 

(P>v 0 + b) (22-94) 


0.5134 _ _Oj_ 
v 0 v*kT 


(b < v < v 0 + b) 


= +00 (v < b). 


(22-95) 

(22-96) 


It is easy to verify (Problem 22-15) that pv/kT is a universal function of 
kT/e, v/r* 3 , and h/(me) ll2 r*, and hence that the quantum-mechanical 
law of corresponding states is obeyed (Section 22-5). It is also apparent 
that the above very simple theory gives for the second virial coefficient 
(upper sign Fermi-Dirac) 

(22-97) 

which should be compared with Eq. (22-82). 

We leave it as an exercise for the reader (Problem 22-16) to deduce 
further properties of the van der Waals-Bose-Einstein fluid. 


22-8 Gas of symmetrical diatomic molecules at low temperatures. In 
this section we consider a rather different type of application of quantum 
symmetry restrictions. A symmetrical (or homonuclear, which is a less 
confusing term in the present context) diatomic molecule is made up of 
two identical nuclei and one or more electrons. It is impossible, experi¬ 
mentally, to distinguish one of these nuclei from the other (when the 
molecule rotates), just as two identical atoms in a box are indistinguish¬ 
able. A wave function representing the state of a system containing 
homonuclear diatomic molecules must therefore be symmetrical or anti- 
symmetrical (depending on the nucleus) in the exchange of the two nuclei 
of any homonuclear diatomic molecule. The wave function must also be 
antisymmetrical in the exchange of any two electrons. A similar restric¬ 
tion obviously applies to more complicated molecules with equivalent 
and identical atoms, such as CH 4 , CDH 3 , etc., but we limit ourselves to 
the diatomic problem. 
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Clearly, there is no analogous symmetry restriction for the nuclei of 
heteronuclear diatomic molecules. Nothing essential needs to be added 
to the discussion of this case already given in Chapter 8 . 

Let us investigate a one-component homonuclear diatomic gas at suf¬ 
ficiently low density (Eq. 4-6) so that classical statistics can be used for 
the translational motion. Then, as in Eq. (22-1), Q = q^/Nl, where q 
is the partition function for a single molecule in a box of volume 

V. In this special case we need consider only the energy eigenvalues ey 
and eigenfunctions 4y of a single molecule. We are to include in the 9 -sum 
only those molecular energy eigenstates with proper symmetry (e.g., states 
which are antisymmetrical in the exchange of the two nuclei for H 2 and 
symmetrical for D 2 , since a hydrogen nucleus contains an odd number of 
nucleons and a deuterium nucleus an even number). 

We have already seen in Section 8-3 that the above symmetry restric¬ 
tions can be ignored and a classical treatment of rotation can be used if 
T » 0 r . Our problem here is to provide a formulation which is suitable 
at low temperatures, that is, when T > > 9 r and the classical approach 
breaks down. In order to reach the range of temperatures at which quan¬ 
tum effects would show up, very low temperatures are needed (see Table 
8-1) except for H 2 (© r = 85.4°K) and D 2 (9 r = 42.7°K). 

As in Eqs. (4-41) and (8-1) through (8-3) we write, for a homonuclear 
diatomic molecule, 


H = H t + Hr + H v + H e + H n , 

e = «< + (r +«» + *« + «n> 

We consider only the ground electronic and nuclear states, for reasons 
already discussed in Chapters 4 and 8 . We now examine the symmetry 
of 4 to see which energy eigenstates are to be included in the 9 -sum. The 
ground state electronic function \f/ e is necessarily antisymmetrical in the 
electrons. Usually (for example, H 2 ), the ground state 4> t is also sym¬ 
metrical in the nuclei: this is the only case we consider. The function 
(which depends only on the coordinates of the center of mass) is not 
affected by exchanging nuclei, nor is \f/ v (which depends only on the inter- 
nuclear distance). Hence both of these functions are symmetrical in the 
nuclei. Therefore 4 has the nuclear symmetry of the product If 
the nuclei have an odd (even) number of nucleons (“mass number”), 
^ and ^v 4 >» must be antisymmetrical (symmetrical) in exchange of the 
nuclei. 

Let * n be the nuclear spin (in units of h/2ir). For example, s» = § 
for H, 8 „ = 1 for D, «« = 0 for He 4 , etc. Then the nuclear ground state 
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degeneracy of each nucleus is w„i = 2s n + 1. Let the nuclear ground 
state energy eigenfunctions be ^j, ^ 2 , • • •, If we denote the two 
(identical) nuclei by A and B, a function of the form 4'i(A)rff 2 (B) — 
4'i(B)i 2 {A) is a possible antisymmetrical nuclear function for the 
diatomic molecule (if A and B are interchanged, the function changes 
sign). Such a function exists for each pair of different numbers 12, 13, 
..., lw„i, etc. Altogether there are «»i(«„i — l)/2 such pairs and 
therefore this number of antisymmetrical nuclear states for the molecule. 

The function ^i(A)^ 2 (B) + >h(B)^ 2 (A), obtained simply by changing 
the sign in an antisymmetrical function, is symmetrical in the exchange 
of the two nuclei A and B. Other symmetrical functions are ^i(A)^i(B), 
$ 2 (A)<fr 2 (B), etc. Thus there are a total of 

«nl(«»l — 1) , .. «nl(«>tl + 1) 

-2- + Wnl “-2- 

symmetrical nuclear states. 

The total number of nuclear states for the molecule is 

«nl(«nl — 1) , «nl(«»l + 1) . 2 

-o- 1 -o-“nl. 


as one should expect. 

Now we turn to <p T . Aside from normalization, we have* 


h(e,<p) = e‘ m *P' m| (cos 8), 


in rather standard notation. The associated Legendre functions can be 
defined by 


p'r'ix) = a - xy mu2 d ' m ^ x) . 



d\x 2 - 1)' 
dx> 


If we interchange nuclei, 9 becomes v — 9 and <p becomes <p + ir. Thus 
x — cos 9 becomes —x. Then we note that 

p/-x) = (-i ypj(x) 

and 

Pf'(-ar) = (—l)'~ |m| P}’’ ,| (z). 


* See any text on quantum mechanics. 
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Also, 

e «»»(»+») _ e wy'»* _ ^ 


Therefore, finally, 

Mir - e, <p + t) = (-l) i - |m| (-l) ,M W(», i») 


We conclude that is antisymmetrical in exchange of the nuclei if j is 
odd and symmetrical if j is even. 

Putting these results together, we have the following situation. If the 
nuclei of the homonuclear diatomic molecule contain an odd (even) num¬ 
ber of nucleons, ^ and must be antisymmetrical (symmetrical) in 
exchange of the nuclei. For rotational states withy even, is symmetrical, 
and hence only antisymmetrical (symmetrical) nuclear states are accessi¬ 
ble—in order to make the product Mn antisymmetrical (symmetrical). 
On the other hand, forj odd, is antisymmetrical, and only symmetrical 
(antisymmetrical) nuclear states are accessible in order to make 
antisymmetrical (symmetrical). We therefore have 

9 = 9t9*9*9m, (22-98) 

with (see Eq. (8-23)] 

Qrn - £ (2j + 

i- o.a,... 


+ W m(^ + 1) £ (2j + l)e -iU + X» rl T (0ddmassn0t)> 

''' (22-99) 


9m — 


<0nl(<*nl + 1 ) 


£ 

j—0,2, . 


+ 


Wnl(<dnl ~ 1) 


£ 

y-i.#.... 

(even mass no.), (22-100) 


where g m is the combined rotational and nuclear partition function which 
cannot here be factored into q r and q„. 

If «„ = 0 (which occurs only with even mass number), as for example 
in w»i = 1 and 

9m = 2 • 

i—0.2.. . . 

In this case odd rotational states do not appear at all. This has been 
verified spectroscopically. 
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For a heteronuclear diatomic molecule XY (see Section 8-3), the equa¬ 
tion corresponding to Eqs. (22-99) and (22-100) is 

9m = 9n9n 

9» = 9r = £ (22-101) 

i-0,1,2.... 


At high temperatures (Eq. 8-25), Eqs. (22-99) and (22-100) both 
reduce to 

~ 1) i Mnl(<«)nl + 1) 11 T 
9rn ~ L 2 + 2 J2 e r 


_ <*nlT 

20 r 


(homonuclear), 


( 22 - 102 ) 


in agreement with Eq. (8-29) with a = 2. In the same limit, Eq. (22-101) 
becomes 

9m = (heteronuclear). (22-103) 

“r 

When the classical forms for q n , (22-102) and (22-103), can be used 
(which is almost always the case), q n is separable and constant (the product 
of the two nuclear spin weights) and is ordinarily omitted, as already 
explained in Sections 2-4 and 4-4. 

We confine ourselves in the remainder of this section to the special 
case of hydrogen gas (see also Problem 22-17 concerning deuterium 
gas, D 2 ). A hydrogen nucleus has odd mass number (one proton) and 
s„ = i, «ni = 2 (spin “up” or “down”). Therefore Eq. (22-99) becomes 

9m = 1 £ +3 £ . (22-104) 

y-o, 2 .... y-i.8_ 


Table 22-2 

Equilibrium Percent p-H 2 in H 2 Gas 


T, °K 

%P-H 2 

0 

15 

30 

50 

100 

200 

298.1 

100 

99.989 

96.95 

76.8 

38.5 

25,953 

25.074 
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Hydrogen molecules in even rotational states (j = 0,2,...) are con¬ 
ventionally referred to as parahydrogen, or p- H a , and those in odd rota¬ 
tional states are called orthohydrogen, o-H a (Problem 22-18). 

The thermodynamic properties of hydrogen gas, at equilibrium, are 
determined by Q = q^/Nl and q = MWWm, with q n given by Eq. 
(22-104). At the temperatures of interest here (10°K-300°K), translation 
is classical, while the vibrational and electronic degrees of freedom are 
completely unexcited. 

The ratio of the number of ortho- to parahydrogen molecules, at equi¬ 
librium, is obviously 

No _ 3£j-l,8.. , ■ _ 

Np I2j—0,2, 


The high-temperature limit is N 0 /N p —» 3, and the low-temperature limit is 

No Z(Ze~ w ' IT + • • •) 
n p 1(1 + 5e~ M,IT H-) 

That is, the equilibrium mixture approaches 100% para (j = 0) as 
T~> 0 (Table 22-2; Problem 22-19). 

The rotational-nuclear contribution to the heat capacity, Cvm, can be 
calculated from Eq. (22-104) after two differentiations with respect to 
temperature [see, for example, Eqs. (8-31) and (8-33)]. The curve ob¬ 
tained is labeled e-H a (e for equilibrium) in Fig. 22-7. Also included in 



Fio. 22-7. Rotational-nuclear contribution to the heat capacity for 0 -H 2 , 
p-H a , e-H 2 (equilibrium mixture at each temperature) and f 0 -H 2 (labeled 
“experimental”). 
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the figure are curves for pure p-H 2 and pure o-H 2) calculated from 

para: q n = 1 2 

i-0.2,... 

ortho: 3r» = 3 £ 

i-i.a, • • • 

The experimental curve for H 2 , shown in the figure, does not agree with 
any of these! This interesting puzzle was resolved by Dennison, who 
realized that, in the absence of a catalyst, the half-life for the interconver¬ 
sion between ortho and para states is very long compared with the time 
of an experiment. Therefore when ordinary hydrogen gas is cooled down 
from room temperature for low-temperature heat-capacity measurements, 
the high-temperature composition (£ o-H 2 ) obtains even at low tempera¬ 
tures, instead of the equilibrium composition (Table 22-2). The gas is 
not in a state of complete equilibrium (as assumed in Table 22-2), but is 
in a state of “frozen” metastable equilibrium with Jo-H 2 . (See Sections 
2-3 and 2-4 in this connection.) If we calculate a heat-capacity curve 
based on f o-H 2 at all temperatures, using 

Cvm = fCVm (ortho) + iCvn (para) 

and the ortho and para curves already included in Fig. 22-7, we find a 
theoretical curve that is in excellent agreement with the experimental 
curve shown in the figure. 

The above analysis is elegantly confirmed by an experiment due to 
Bonhoeffer and Harteck: active charcoal is found to be a catalyst (the 
mechanism involves adsorbed free H atoms) for the ortho-para conversion; 
in the presence of a catalyst, points on the equilibrium, curve (labeled 
c-H 2 ) in Fig. 22-7 are obtained experimentally. 
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Problems 

22-1. Explain how symmetry restrictions on accessible quantum states apply 
to multicomponent systems. (Page 432.) 

22-2. Construct a table analogous to Table 22-1 for a system of four particles 
with a total of 12 energy “units.” (Page 433.) 

22-3. Expand the determinant in Eq. (22-6), and show that the wave func¬ 
tion is symmetrical. (Page 435.) 

22-4. Use the microcanonical ensemble and the method of undetermined 
multipliers to derive the Fermi-Dirac distribution law, (22-12). To avoid 
applying the Stirling approximation to small numbers, the quantum states 
should be treated in large groups with essentially the same energy. (Page 437.) 

22-5. Show that the entropy of an ideal quantum gas can be written in the 
form (upper sign Fermi-Dirac) 

S = -*£ [Sy In 3) ± (1 =F Sy) In (1 =F 8y)J. 
i 

(Page 440.) 

22-6. Derive an equation for the equilibrium vapor pressure of electrons in 
a metal assuming the gas phase is very dilute (classical statistics; ignore space 
charge) and using the low temperature limit for p in the metal. Note that it is 
necessary to introduce the potential <p in this problem. (Page 441.) 

22-7. Show that X,u> <K 1 for the electrons in a typical metal only above 
about 10 6o K. (Page 442.) 

22-8. Calculate a numerical value (in ev) for the Fermi energy po of a typical 
metal. Show in an energy-level diagram the relation between po and <p (electron 
at rest in gas phase = zero of energy). (Page 442.) 

22-9. Use Eqs. (22-30) and (22-31) for an electron gas to derive equations 
for the thermodynamic functions S, pV, and Np. (Page 444.) 

22-10. Extend the series (22-38) through (22-42) one more term for the Bose- 
Einstein case. (Page 445.) 

22-11. Derive the equation 

m 

for a monatomic gas from the virial expansion and show its equivalence with 
Eq. (22-37) for an ideal quantum gas. (Page 446.) 

22-12. Show that the radiation pressure p = E/ZV is about 1/5 atm at 
10 5 °K. (Page 456.) 

22-13. Use Eq. (22-75) to derive Eq. (22-76) for the Slater sum of two non¬ 
interacting particles in a one-dimensional box. Note that N\ A 3Ar in Eq. (22-75) 
becomes 2!A 2 in this case. (Page 459.) 

22-14. Use the values of t and r* in Table IV-1 to calculate the quantum 
parameter h/(me) 1,2 r* for H 2 , He 3 , He 4 , A, and Xe. (Page 462.) 

22-15. Show that the van der Waals-Bose-Einstein equation of state, (22-94) 
through (22-96), obeys the quantum-mechanical law of corresponding states. 
(Page 466.) 
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22-16. Investigate the nature of the phase transition and Cv(T) for the 
van der Waals-Bosc-Einstein fluid. (Page 466.) 

”22-17. Give a discussion of the Dj case analogous to that in the text for H 2 . 
(Page 470.) 

22-18. Write out the four ground state nuclear wave functions for H 2 . Classify 
them as ortho and para and as symmetrical or antisymmetrical in exchange 
of the nuclei. (Page 471.) 

22-19. Calculate the equilibrium percent of p-Kfe in H 2 gas at 30°K. (Page 471.) 

22-20. Investigate the properties of one- and two-dimensional ideal Bose- 
Einstein gases. 

22-21. Investigate the properties of the following ideal two component 
systems: BE-BE, BE-FD, FD-FD. 

22-22. Derive an expression for the radial distribution function in an ideal 
quantum gas in the limit p —» 0. 

22-23. (a) Investigate the fluctuation in N in an isolated photon gas (E and 
V given). Compare Section 10-3. (b) Consider the same question for an iso¬ 
thermal system (T and V given). 

22-24. Derive equations for the fluctuation in sj in systems of noninteracting 
Bosc-Einstein and Fermi-Dirac particles. Compare Section 7-2. 

22-25. Verify that the Clausius-Clapeyron equation, dp/dT ■=» AS/AV for a 
first-order phase transition, where p is the vapor pressure, is satisfied in the ideal 
Bose-Einstein condensation. 

22-26. Consider the following hypothetical system with hybrid “quantum” 
statistics. The particles do not interact in the usual sense, and any number of 
particles can be in a single quantum state. But when 8 particles are in the same 
state «, there is a new kind of “interaction” energy, (* — l)w, which favors or 
disfavors more than one particle occupying the same state. The 8 particles 
contribute a total of ee + (s — l)w to the energy. In Bose-Einstein statistics, 
w =» 0; in Fermi-Dirac statistics, w = +00 . Consider the general case, for a 
gas, where w is finite (positive or negative) and investigate the thermodynamic 
properties of the system. Use the grand partition function method of Eqs. 
(22-10) and (22-15). Note the formal resemblance to B.E.T. adsorption theory 
(Eq. 7-36). 

*22-27. Derive equations for the asymptotic low-temperature heat capacity 
Cm for 0 -H 2 and P-H 2 . Calculate Cr™ at 60°K in the two cases. 
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Supplementary Reading 

Band, Chapters 8,10 and 11. 

Fowler and Guggenheim, Chapters 1 and 11. 
Hirschfeldeb, Curtiss, and Bird, Chapter 6. 

Kitted, Part 1. 

Landau and Litshitz, Chapters 5-7, Section 33. 

London, F., Superfluids, Vol. II. New York: Wiley, 1954. 
Mayer and Mayer, Chapters 2, 5, 7 and 16. 

Prigogine, Chapters 18 and 19. 

SchrSdinger, Chapters 7-9. 

Slater, Chapters 5,19, 27-29. 

S. M., Chapters 2 and 3. 

ter Haar, Chapters 3, 4, 7-10. 
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APPENDIX I 

NATURAL CONSTANTS 


Quantity 

Symbol 

Value 

Avogadro’s number 


6.02486 X 10 23 mole- 1 

Velocity of light 

c 

2.997930 X 10 10 cm-sec- 1 

Electronic charge 

e 

4.80286 X 10 _1 ° esu 

Electron rest mass 


9.1083 X 10- 28 gm 

Planck’s constant 

h 

6.62517 X 10" 27 erg-sec 

Mass of hydrogen atom 


1.67330 X 10" 24 gm 

Mass of unit atomic weight 


1.65979 X 10 -24 gm 

Mass of proton 


1.67239 X lO" 24 gm 

Boltzmann’s constant 

k 

1.38044 X 10 -16 erg-deg -1 

Gas constant 

R - iVok 

8.31696 X 10 7 erg-deg _1 -mole _1 

Temperature scales 


0°C = 273.15°K 

1 cal — 4.184 joules « 4.184 X 10 7 ergs 

1 ev - 1.60206 X 10~ 12 erg = 23.0693 kcal-mole' 1 

1 atm =» 1.0133 X 10® dynes- 

cm -2 = 760 mm Hg 
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